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The Mandelbrot Set, the Farey Tree, 
and the Fibonacci Sequence 


Robert L. Devaney 


1. INTRODUCTION. Our goal is to explain and to make precise several “folk 
theorems” involving the Mandelbrot set and the Farey tree [4]. 

The Mandelbrot set is a subset of the parameter plane for iteration of the 
complex quadratic function Q(z) =z’? +c. Here the parameter c is complex. 
The Mandelbrot set 4 consists of those c values for which the orbit of 0—the 
sequence 0, Q,(0), O.(0,(0)) = Q2(0), 02(0), ... —is bounded. 

One reason for singling out the orbit of 0 is the following important fact from 
complex dynamics: If Q. possesses an attracting cycle, then the orbit of 0, the 
critical point, must converge to that cycle [3]; a cycle is an_ orbit 
Zo, O.(Zy),..., Q"(Zy) = Zp that returns to itself after n iterations. A cycle is called 
attracting if all sufficiently nearby points have orbits that tend to the cycle. 

Since the orbit of 0 tends to any attracting cycle of Q., it follows that Q. admits 
at most one attracting cycle. Also, a c-value for which Q. has an attracting cycle 
must lie in .@ since the orbit of 0 is bounded. In fact, the c-values for which Q. 
has an attracting cycle comprise all of the visible interior of the Mandelbrot set. By 
visible, we mean that nobody has ever found experimentally or otherwise a 
component of the interior that does not have this property. One of the main 
conjectures concerning /# is that its interior consists only of c-values for which 
there is an attracting cycle. | 

The Mandelbrot set features a basic cardioid shape from which hang numerous 
“bulbs” or “decorations”; see Figure 1. Each of these bulbs is a large disk that is 
directly attached to the cardioid, together with numerous other smaller decora- 
tions and a prominent “antenna.” 

Each of these large disks turns out to contain c-values for which Q. admits an 
attracting cycle with period q and rotation number p/gq. That is, the attracting 
cycle of Q. tends to rotate about a central fixed point, turning on average p/q 
revolutions at each iteration. For this reason, this bulb is called the p/gq bulb. 
Each of the c-values in this bulb has essentially the same dynamical behavior. 

A perhaps surprising folk theorem says that we can recognize the p/q-bulb 
from the geometry of the bulb itself. That is, we can read off dynamical informa- 
tion from the geometric information contained in the Mandelbrot set. 

For example, the 2/5 bulb is displayed in Figure 2. For any c-value in this large 
disk, Q. features an attracting cycle with rotation number 2/5. The 2/5 bulb 
possesses an antenna-like structure that features a junction point from which five 
spokes emanate. One of these spokes is attached directly to the 2/5 bulb; we call 
this spoke the principal spoke. Now look at the “smallest” of the non-principal 
spokes. Note that this spoke is located, roughly speaking, 2/5 of a turn in the 
counterclockwise direction from the principal spoke. This is how we identify this 
bulb as the 2/5-bulb. 
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Figure 1. The Mandelbrot set. 


Figure 2. The 2/5 bulb. 


As another example, in Figure 3 we display the 3/7 bulb. This bulb has 7 spokes 
emanating from the junction point, and the smallest is located 3/7 of a turn in the 
counterclockwise direction from the principal spoke. This then is the folk theorem: 
You can recognize the p/q bulb by locating the “smallest”? spoke in the antenna 
and determining its location relative to the principal spoke. Of course, the word 
“smallest” needs some clarification here; our goal is to make this notion precise. 


290 THE MANDELBROT SET [April 


Figure 3. The 3/7 bulb. 


As an additional disclaimer, this folk theorem is only about 80% true using the 
Euclidean notion of “smallness” or Lebesgue measure. We provide a somewhat 
different framework in which this result is always true. 

There is more to the story of interplay between the geometry of the Mandelbrot 
set and the corresponding dynamics. In Figure 4, we display the 1/2 and 1/3 
bulbs. The 1/2 bulb is the large bulb to the left; the 1/3 bulb is the topmost bulb. 
In between these two bulbs are infinitely many smaller bulbs, but the largest we 
recognize as the 2/5 bulb. Now note that 2/5 can be obtained from 1/2 and 1/3 
by “Farey addition”: 


12. 2 

3°35 
As a second example, note that 

2.1 3 

5° 3 8 


and that the 3/8 bulb is the largest between the 2/5 and 1/3 bulbs; see Figure 5. 

That is, to obtain the largest bulb between two given bulbs (in a particular 
ordering), we simply add the corresponding fractions just the way we always 
wanted to add them, namely by adding the numerators and adding the denomina- 
tors. This is the second of the folk theorems we want to discuss. In particular, it 
follows that the size of bulbs is determined by the Farey tree, as we show in 
Section 6. | 


Figures 4 and 5 represent the beginning of a very special sequence of p/q bulbs 
in the Mandelbrot set 
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whose numerators and denominators correspond to the Fibonacci sequence. We 
discuss this connection in more detail in Section 8. 

While we do not give complete proofs of each of these folk theorems, we do 
indicate some of the combinatorial arguments involved in making the statements 
precise. For more folk theorems and complete proofs, see [5]. 


2. THE FAREY TREE. Before discussing the Mandelbrot set, we recall a few facts 
about the Farey tree, which is a tree containing all of the rationals between 0 and 1. 
At each stage of its construction, the Farey tree consists of a finite list of rationals. 
Adjacent rationals in this list are called Farey neighbors. The inductive step in the 
construction of the tree is: Each pair of Farey neighbors produces a Farey child, 
which is the rational between the two whose denominator is the smallest. Natu- 
rally, the rationals that produce a Farey child are called its Farey parents. 

One of the most intriguing features of the Farey tree is that we obtain Farey 
children by Farey addition. That is, the fraction between the Farey neighbors a/B 
and y/6 is given by 

| a Yy at+y 


B 8 BtS 


So, to obtain the fraction between two Farey neighbors whose denominator is the 
smallest, we simply add the numerators and add the denominators of the parents 
to obtain the child. For a proof that this yields the fraction between the parents 
with smallest denominator, we refer to [8]. 

We begin the construction of the tree with the pair of rationals 0 and 1, which 
we write as 0/1 and 1/1. Their child is 1/2, so the second stage of the 
construction gives the list 


0 1 1 
127 
At the next stage we obtain two new Farey children 
0112 1 
t 32 3 4 


The Farey tree contains all rationals; see [8] or [9] for more details. 
One other fact that we use is that a/f and y/6 are Farey neighbors if and only 
if ad — yB = +1. Consequently, we have 


=—. (1) 


This equality is easily proved by induction. 
3. THE MANDELBROT SET. The Mandelbrot set is 
M = {c\Q”"(0) is bounded} 
for Q.(z) = z* +c. Thus @ gives a picture of those c-values for which the orbit of 


0 under Q. does not tend to ~. 
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The visible bulbs in 4 correspond to c-values for which Q. has an attracting 
cycle of some given period. For example, the main central cardioid in 4 consists 
of c-values for which Q. has an attracting fixed point. This can be seen by solving 
for the fixed points (z? + c = z) that are attracting: |Q’.(z)|= |2z| < 1. Solving 
these equations simultaneously, we see that the boundary of this region is given by 
c =z —2z’, where z = 5e7"*, That is, 


1 
c(0) = = e2mib _ 7 


parametrizes the boundary of the cardioid with 0 < 6 <1. At c(@), Q.:g) has a 
fixed point that is neutral; the derivative of Q,,g) at this fixed point is e*”’”. | 

For each rational value of 6, there is a bulb tangent to the main cardioid at 
c(@). For c-values in the bulb attached to the cardioid at c(p/q), Q, has an 
attracting cycle of period g. We call this bulb the p/q bulb attached to the main 
cardioid and denote it by B, ,,. 

It is known that, as c passes from the main cardioid, through c(p /4); and into 
B,/q Q- undergoes a p/q-bifurcation. By this we mean: when c lies in the main 
cardioid near c(p/q), Q. has an attracting fixed point with a nearby repelling 
cycle of period g. At c(p/q) the attracting fixed point and repelling cycle merge to 
produce a neutral fixed point with derivative e*7'?/7. When c lies in B p/q? Qe NOW 
has an attracting cycle of period qg and a repelling fixed point. 

When c = c(p/q), the local (linearized) dynamics are given by rotation through 
angle 27(p/q). As a consequence, for nearby c © B,,,, the attracting cycle 
rotates about the repelling fixed point by jumping approximately 2a( p /q)t radians 
at each iteration. For more details see [2]. 


4. ANGLE DOUBLING MOD 1. To prepare to use the fundamental results of 
Douady and Hubbard [6] regarding the Mandelbrot set we digress to recall some 
facts about the doubling function, which is defined on the circle considered as the 
reals modulo one and is given by D(@) = 26 mod 1. 


Fact 1: The angle 0 is periodic under D if and only if 6 is a rational of the form 
P/q (in lowest terms) with q odd. 


For example, the D-orbit of 1/3 is 
1 2 1 
— — — —_ — —_ eee 5 
3 3 3 
which has period 2. The rational 1/7 has period 3 under doubling: 


1 Z 4 1 


while 1/5 has period 4: 
1 p) 4 3 1 
ee ete titel iteetiee Dita MI 


5 5 5 5 5 


The rationals with even denominator are eventually periodic but not periodic. 
For example, 1/6 lies on an eventual 2-cycle 
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and 1/8 is eventually fixed: 
1 1 1 


3 — Fi => 5 ->1->1-> 

A second important fact about doubling is that we can read off the binary 
expansion of @ by noting the itinerary of 6 in the circle relative to D. To define 
the itinerary, we denote the upper semicircle 0 < 6< 1/2 by J, and the lower 
semicircle 1/2 < 6 < 1 by J,. Given 9, we attach an infinite string of 0’s and 1’s to 
6 as follows: The itinerary of 6 is B(O) = (sqs,5...), where s, is either 0 or 1: 
s; = Oif D/(@) € Ih, and s, = 1 if D/(O) € J,. That is, we simply watch the orbit of 
6 in the circle under doubling and assign 0 or 1 to the itinerary whenever D/(0) 
lands in the arc J, or J,. 


Fact 2: The itinerary B(@) is the binary expansion of 0. 


For example, if 9 = 1/3, then 6 € I, while D(6) € J, and D*(@) = 6. Hence 
B(1/3) is the repeating sequence 01, which of course is the binary expansion of 
1/3. Similarly, B(1/7) = 001 while B(1/5) = 0011. 


5. EXTERNAL RAYS. In order to make precise the folk theorems mentioned in 
the introduction, we recall some beautiful results of Douady and Hubbard [7] 
concerning the external rays of the Mandelbrot set. 

Let E = {z| |z| > 1} denote the exterior of the unit circle in the plane. Accord- 
ing to Douady and Hubbard, there is a unique analytic isomorphism ® that maps 
FE to the exterior of the Mandelbrot set. The mapping ® takes positive reals to 
positive reals. This mapping is the uniformization of the exterior of the Mandel- 
brot set, or the exterior Riemann map. 

The importance of ® stems from the fact that the image under ® of the 
Straight rays 6 = constant in EF have dynamical significance. In the Mandelbrot 
set, we define the external ray with external angle 0, to be the ®-image of @ = 6p. It 
is known that an external ray whose angle 6, is rational actually “lands” on 4. 
That is lim, _, ,®(re?7'*°) exists and is a unique point on the boundary of .4. This 
c-value is called the landing point of the ray with angle 65. 

For example, the ray with angle 0 lies on the real axis and lands on 4 at the 
cusp of the main cardioid, namely c = 1/4. Also, the ray with angle 1/2 lies on 
the negative real axis and lands on 4 at the tip of the “tail” of 4, which can be 
shown to be c = —2. 

Consider now the interior of 4. The interior consists of infinitely many simply 
connected regions. A bulb of 4 is a component of the interior of 4 in which 
each c-value corresponds to a quadratic function that admits an attracting cycle. 
The period of this cycle is constant over each bulb. In many cases, a bulb is 
attached to a component of lower period at a unique point called the root point of 
the component. 

An important result of Douady and Hubbard is: 


Theorem 1. Suppose a bulb B consists of c-values for which the quadratic map has an 
attracting q-cycle. Then the root point of this bulb is the landing point of exactly 2 rays, 
and the angles of each of these rays have period q under doubling. 


Thus, the angles of the external rays of 4 determine the ordering of the bulbs 
in .@. For example, the large bulb directly to the left of the main cardioid is the 
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1/2 bulb, so two rays with period 2 under doubling must land there. Now the only 
angles with period 2 under doubling are 1/3 and 2/3, so these are the angles of 
the rays that land at the root point of B,,.. 

Now consider the 1/3 bulb atop the main cardioid. This bulb lies “between” the 
rays 0 and 1/3. There are only two angles between 0 and 1/3 that have period 3 
under doubling, namely 1/7 and 2/7, so these are the rays that land at the root 
point of B, ,,. 

The 2/5 bulb lies between the 1 /3 and 1/2 bulbs. Hence the rays that land at 
c(2/5) must have period 5 under doubling and lie between 2/7 and 1/3. The only 
angles that have this property are 9/31 and 10/31, so these rays must land at 
c(2/5); see Figure 6. | 


9/31 2/1 1/7 
10/31 2/15 


me 1/15 


1/31 


Figure 6. Rays landing on the Mandelbrot set. 


These ideas allow us to measure the “largeness” or “smallness” of portions of 
the Mandelbrot set. Suppose we have two rays with angles @_ and 6, that both 
land at a point c, in the boundary of 4. 

Then, by the isomorphism ®, all rays with angles between 0_ and 6, must 
approach the component of M — {c,} cut off by 6_ and @,. (It is not known that 
all such rays actually land on .@— indeed, this is the major open conjecture about 
é.) Thus it is natural to measure the size of this portion of @ by the length of the 
interval [6_, 6, ]. 

The root point of the p/q bulb of 4 divides .Z into two sets. The component 
containing the p/q bulb is called the p/q limb. We can then measure the size of 
the p/gq limb if we know the external rays that land on the root point of the p/q 
bulb. We compute these rays in the next section. 
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6. RAYS LANDING ON THE p/q BULB. In order to make the notion of “large” 
or “small” precise in the statement of the folk theorems, we need a way to 


determine the angles of the rays landing at the root point of B, ,,. We denote the 


angles of these two rays in binary by /,(p/q), where /_( p/q)<1/,(p/q). We 
call /_( p/q) the lower angle of B,,, and!,(p/q) the upper angle. 

As we will see, /,(p/q) is a string of q digits (0 or 1) and sol, (p/q) denotes 
the infinite repeating sequence whose basic block is /,(p/q). Douady and Hub- 
bard [6] have a geometric method involving Julia sets to determine these angles. 
Our method is more combinatorial and resembles algorithms due to Atela [1], 
LaVaurs [10], and Lau and Schleicher [11]. 

To describe this algorithm, let R,,, denote rotation of the unit circle through 
p/gq turns, Le., 


R,,q(9) = e2THO+p /q) 


We consider the itineraries of points in the unit circle under R using two different 
partitions of the circle. 
The lower partition of the circle is defined as follows. Let [> = {9|]0< @< 

1 — p/q} and I, = {6|1 — p/q < 6 < 1}. The boundary point 0 belongs to J; and 
—p/q=1-p/q belongs to Ij. We then define s_( p/q) to be the itinerary of 
p/q under R,,,, relative to this partition. We call the basic repeating block of this 
itinerary, s_(p/q), the lower itinerary of p/q. That is, s_(p/q) = s,...8, where s, 
is either 0 or 1 and the digit s, is 0 if and only if RJ/((p/q) © I). Otherwise, 

.= 1. 

’ For example, s_(1/3) = 001 since 

I5=(0,2/3], 17 =(2/3,1], 


and the orbit } > +> 17 4-> - lies in J), 1), 1,, respectively. 
Similarly, s_(2/5) = 01001 since 
Iy=(0,3/5], 1 =(3/5,1], 
and the orbit is > $27 37370747 
We also define the upper partition I and I; as follows: 


Iy=[0,1- p/q), If=[1-p/q,1). 


The upper itinerary of p/q, s.(p/q), is then the repeating block of the itinerary of 
p/q relative to this partition. Note that Jj and J; differ from Jj and J; only at 
the endpoints. — 

For example, s, (1/3) = 010 since the orbit is } > => 0--- and 


I{= [0,2/3), f= [2/3,1). 


This orbit starts in J, hops to J;*, and then returns to J; before cycling. For 2/5, 
we have 

Iy=[|0,3/5), JIf=[3/5,1) 
and s,(2/5) = 01010. 


The following theorem provides an algorithm for computing the angles of rays 
landing at c(p/q). For a proof, we refer to [5] and [6]. 


Theorem 2. The rays 1 ,(p/q) landing at the root point c(p/q) of the p/q bulb are 
given by s_(p/q) and s,( p/@q). 
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Note that s,(p/q) differ only in their last two digits (provided q > 2). Indeed 

we may write 

s_(p/q) — Sy... Sg-20 1 

Si (p/q) = 81--+84-710 
The reason for this is that the upper and lower itineraries are the same except at 
R}7q(P/Q) = —p/q and R47 (p/q) = 0, which form the endpoints of the two 
partitions of the circle. 

We now define the size of the p/q limb to be the length of the interval 
[s_(p/q), s,(p/q)l. That is, the size of the p/q limb is given by the number of 
external rays that approach. this limb. We may compute size of these bulbs 
explicitly by using the fact that s,(p/q) differ only in the last two digits. 


Theorem 3. Zhe size of the p/q limb is 1/(24 — 1). That is 


eo ee 1 
$+(P/4) — 8-(P/4) = 5 - (2) 


Proof: We write the binary expansion of the difference in the form 


ae 1 1 1 
S,(P/4) — 8_(e/4) = sy=7 + Sagar + Gag=T 
1 
re lag + 53g + 53 T 
1 2! 1 2! 1 


AS we see in Figure 7, the visual size of the bulbs does indeed correspond to the 
size as defined above. 3 


7. THE SIZE OF LIMBS AND THE FAREY TREE. In this section we relate the 
size of a p/gq limb to the size of the limbs corresponding to the Farey parents of 
p/q. The following proposition relates the upper and lower itineraries of p/q and 
its Farey parents. 


Proposition 1. Suppose a/B and y/6 are the Farey parents of p/q and that 
0 < a/B < y/6 < 1. Then the lower itinerary s_(p/q) consists of the first q digits of 
the upper angle s,(a/B) of the smaller parent, and the upper itinerary s,(p/q) 
consists of the first q digits of the lower angle s_(y/6) of the larger parent. 


Proof: We consider only s,(p/q); the proof for s_(p/q) is similar. 
From (1), we have 


y p 1 


8 q gh 
Consider the orbits of p/q and y/6 relative to the respective rotations R, ,, and 
R75. Since y/6 rotates faster than p/q, the distance between these orbits 
advances by 1/q6 at each iteration. We thus have 
| i+ 1 


| | j 
Ry3(¥/8) — Rh g(P/G) = Ga 
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1/31 a 


Figure 7. Size of the 2/5 and 1/3 limbs of 4. 


It follows that Ri ,,(p/q) lies within 1/6 units of RJ ,;(y/5) provided j < q — 1. 
Since points on the orbit of y/6 under R,,,; lie exactly 1/6 units apart on the 
circle, it follows that the first gq — 1 entries in the itineraries of p/q and y/6 are 
the same, provided we choose the lower itinerary for y/6 and the upper itinerary 
for p/q. The reason for this is that the orbit of y/6 lies ahead of that of p/q in 
the counterclockwise direction, but by no more than 1/6 units. Choosing the 
upper itinerary for p/q and the lower for y/6 forces the corresponding digits to 
be the same. 
When j = q — 1, we have R, ,,(p/q) = 0 and 


1 
R473(¥/8) — R37 4(P/4) = — = =: 


q 
qo 3) 


Hence 
—]1 1 
Ry73(¥/5) = =. 


Therefore the qth digit in s,(p/q) is 0 and so is the qth digit of y/6, as long as 
y/6 # 1. ia 


If one of the Farey parents is 0 or 1, we must modify Proposition 1. 


Proposition 2. Suppose that 0 is a Farey parent of p /q. Then the q digits in the lower 
itinerary of p/q are s_(p/q) =0...01. If 1 is a Farey parent of p/q then s,(p/q) 
= 1...10. 
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Proof: For s_(p/q), we first note that, since 0/1 is a Farey parent, we must have 
p =1. Thus, s_(p/q) is given by the itinerary of 1/g under counterclockwise 
rotation by 1/q units. We therefore have 


Iy=(0,(¢-1)/q], Ip=((a-1)/¢, 1]. 


It follows that the first g — 1 digits of s_(1/q) are 0, and the last digit is 1. If a 
Farey parent is 1/1, the proof is similar, since in this case p = q — 1. Bal 


We now complete the proof of one of the folk theorems mentioned in the 
introduction. 


Theorem 4. Suppose a/B and y/6 are the Farey parents of p/q and that 0 < a/B 
< y/6 < 1. Then the size of the p/q limb is larger than the size of any other limb 
between the a/B and y/6 limbs. 


Proof: Assume first that neither parent is 0 or 1. Propositions 1 and 2 ensure that 
s_(p/q) ands,(a/B) agree in their first g digits. Using these binary representa- 
tions, we have 


ee Be ee 1 
s_(p/q) —5,(@/B) < 74° 


Similarly 


eet: eee, | 
s_(y/6) — s,(p/q) < 74° 


This implies that the arc of rays between the p/q limb and either of its parents’ 
limbs has length no larger than 1/27. Thus any limb between them has size smaller 
than 1/24. 

From (2), we know that. 


Be eS | ee 1 

Si (P/4) — 8_(P/4) = 557: 
As this quantity is larger than 1/27, it follows that the p/q limb attracts the 
largest number of rays between its two parents. 

If one of the parents of 1/q is 0, then the size of the 1/q bulb is again 
1/(24 — 1), while the gap between 0 and s_( p/q)=0...01 is also 1/(24 — 1). 
But then any limb between the 1/q limb and the cusp of the cardioid must have 
size strictly smaller than 1/(27 — 1), again showing that the 1/q limb is the largest. 
The case of Farey parent 1 is handled similarly. | 


8. THE FIBONACCI SEQUENCE. Theorem 4 shows that the Fibonacci sequence 
appears in the Mandelbrot set. As we have seen in Figures 4 and 5, the largest 
bulb between the 1/2 and 1/3 bulb is the 2/5 bulb, and the largest between the 
1/3 and 2/5 bulbs is the 3/8 bulb. This progression continues, with the numera- 
tors (and denominators) forming the Fibonacci sequence. For example, we next 
have 


@-=—., 
5 8 13 


The corresponding bulbs are shown in Figure 8. 
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5 


Figure 8. © ;= =. 


This sequence of bulbs actually converges to a single point on the boundary of 
the main cardioid. At this particular c-value, Q. is known to have a fixed point Zp 
with Q'(z,) = e’7’’, where 6 is related to the golden ratio and hence is highly 
irrational. The dynamics of complex functions near such fixed points is the subject 
of the Fields’ Medal work of J.-C. Yoccoz [12]. 


9. CONCLUSION. The technique of measuring the size of certain portions of the 
Mandelbrot set by the length of the interval of rays that land on that portion 
provides justification for other folk theorems involving the size of .@. For example, 
this technique is used to identify the p/q bulb using the “lengths” of the spokes in 
its antenna. Once we know these rays, we can easily compute the lengths of the 
various spokes. 

For example, it can be shown that the two rays that land at the junction point of 
the antenna adjacent to the principal spoke are given by s_s, and s,s_, where we 
have dropped the p/q for clarity. These two rays are therefore given by preperi- 
odic binary sequences that begin to repeat only after the qth entry. Thus, the vast 
majority of rays that land on the p/q limb actually approach the spokes of the 
antenna. For we have the following ordering of the rays landing on the p/q bulb: 


S_<S_S, <S,S5S_ <S,. 
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It is easy to check that the length of the arc of rays approaching the antenna 
between s_s, and s,s_ is 


1 f 
29-1 -2.9(29-1) ° 


This number is much larger (for large q) than the length of the arc between s_ and 
s_s, or between s, and s,s_, each of which has length 


1 
29(29-1) 


We can also use the two rays separating the principal spoke from the rest of the 
antenna to determine a list of the qg rays that land on the junction point. Then we 
can determine that the shortest is located p/q turns in the counterclockwise 
direction from the principal spoke. See [5] for details. 
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Existence Proofs 


Fred Richman 


[If] a proof convinces you that there is a root of an equation (without giving 
you any idea where )—how do you know that you understand the proposition 
that there is a root? | Ludwig Wittgenstein 


The proposition that mathematics is grounded on computation would seem to be 
quite uncontroversial; the only people I have heard argue against it are mathemati- 
cians. By “mathematics” I mean pure mathematics—theorems and proofs. Many 
pure mathematicians think that they engage in a high art form that is incompatible 
with strong links to computation, the nerdy province of bookkeepers, statisticians, 
and calculators. This attitude goes hand in glove with the practically unquestioned 
acceptance of nonconstructive existence proofs in modern mathematics. 

Constructive mathematicians are unsatisfied by nonconstructive existence proofs: 
proofs that attempt to convince you of the existence of a number, or of some more 
complicated mathematical object, without giving any method for computing it. The 
difference between constructive mathematics and classical mathematics is that 
when a constructive mathematician says there is a number that satisfies a given 
equation, or has some other property, he has an algorithm in his pocket for 
computing that number. The pocket of a classical mathematician, who makes the 
same statement, might contain only a derivation of a contradiction from the 
assumption that every number fails to have the property. 

The quotation from Wittgenstein [12, p. 282] at the head of this article suggests 
that the peculiar nature of such a proof should cause us to reconsider the meaning 
of the proposition it proves. Did we really understand what was meant by “there is 
a root” if we can be convinced of its truth by an argument that does not provide a 
method for finding a root? What are these numbers that exist without our being 
able to construct them? I am reminded of the Nobel laureate Eugene Wigner’s 
response to the question of whether there are any inherently unknowable laws of 
physics: he said, “I don’t know of any.” 

I will illustrate the idea of a nonconstructive proof by several examples. The 
first is of a theorem that admits a celebrated constructive proof (the Euclidean 
algorithm). Many textbooks give both proofs, but the constructive one is usually 
presented as a method of computation rather than a proof. 


Theorem 1. There exist integers s and t such that 437s + 323t is positive and divides 
437 and 323. 


Of course the numbers 437 and 323 are not special; I choose them to make sure 
we see immediately which are the constants and which are the variables. It follows 
immediately from the theorem that the number 437s + 323t is the greatest 
common divisor of 437 and 323. Here is a nonconstructive proof of the theorem. 
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Proof: Consider the set of positive integers 
S = {d:d > Oand d = 437s + 323t for some s and f}. 


The set § is nonempty because 437 and 323 are clearly in it. Therefore, by what is 
often called the ‘“well-ordering principle,” there is a least element d, of S. As 
dy <= S we can write d, = 437s) + 323t,). The division algorithm enables us to 
write 437 = qd, +r, where 0<r<d,. If r>0, then r= 4370 — qgs,) + 
323(—qt,) is in S. But d, is the smallest element of S, so r = 0. Thus d, divides 
437; similarly d, divides 323. | 


That was a proof in the spirit of the Wittgenstein quotation. What are these 
integers s and ¢ that we have shown to exist? We have not been given a clue as to 
how to find them. They were constructed by choosing the smallest element of the 
set S$. But how do we find that smallest element? What is the basis for the 
well-ordering principle? — | 

A proof of the well-ordering principle, as applied to a set S of positive integers 
that contains 323, might go as follows. If 1 < S, then 1 is the smallest element of S. 
If 1 € S, but 2 € S, then 2 is the smallest element of S. If 1 € S and 2 € S, but 
3 € §, then 3 is the smallest element of S. We can write this as a computer | 
program as follows: for i = 1 to 323 do if i © S return i. 

What does this proof of the well-ordering principle prove? It proves that a 
nonempty detachable subset of the positive integers has a least element. A subset 
S of a set X is called detachable if you can tell (there is an algorithm for telling) 
whether or not any given element of X is in S. The problem with the set S in the 
proof of Theorem 1 is that we have not established that it is detachable; for 
example, how do we decide whether or not 1 € S? In fact S is detachable, but to 
prove that we usually invoke Theorem 1! 

A constructive proof of Theorem 1 might go like this. Consider the following 
table. 


437 1 0 
323 0 1 
114 1 -l 
oy “SZ 3 
19 3 —4 


Rach row represents values of d, s, and t such that d = 437s + 323t. This is clear 
for the first two rows. Each subsequent row R,,, is computed from the previous 
two rows R,_, and R, by setting R,,, = R,_, — m,R,. We choose the m, so as 
to make the first entry of R,,, positive, and as small as possible. The first entries 
of the rows must decrease, unless the first entry of R, divides the first entry of 
R,,-1, in which case we stop. The equation d = 437s + 323¢ is inherited by R,,, 
from R,, and R,,_,. Moreover, as R,_, =m, R, + R,,, any number that divides 
the first entries of two consecutive rows, divides the first entry of the row before 
them. So 19 divides all the numbers above it; in particular, 437 and 323. 

Errett Bishop, the mathematician most responsible for the recent renaissance of 
constructive mathematics, formulated four principles of constructive mathematics 
in [2]: 


(A) Mathematics is common sense. 

(B) Do not ask whether a statement is true until you know what it means. 
(C) A proof is any completely convincing argument. 

(D) Meaningful distinctions deserve to be maintained. 
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Constructivists think that a proof of the existence of a mathematical object should 
tell you how to construct that object; this follows from their belief that other 
interpretations of the phrase “there exists,” in a mathematical context, are either 
incomprehensible, or can be formulated in more descriptive ways (principles B 
and D). 

These are controversial, even revolutionary, ideas. Constructivists want to make 
fundamental changes in the way we view mathematics; they want to change the 
rules by which the game of mathematics is played. But most people aren’t 
interested in changing the rules. For one reason, most people like the rules as they 
are. In fact, successful mathematicians have a vested interest in keeping the rules 
as they are. Why should a champion chess player be interested in changing the 
rules of chess? Another reason is that, after all, aren’t the present rules of 
mathematics correct? How can there be any serious alternatives? 

Constructive mathematics is not just an idea but a substantial body of results. In 
[1] Bishop developed much of analysis along constructive lines. This classic book 
has been revised and extended [3]. A corresponding constructive development of 
algebra was carried out in [9]. For expository articles on constructive mathematics 
see [4], [5], [8], [10], and [11]. 

Let’s look at another example: consider the decimal expansion 

qm = 3.1415926535897932384626433832795.... 
This decimal expansion suits our purposes because it is a familiar example of a 
computable infinite sequence, and very little is known about it. The theorem I 
want to consider says that 


Theorem 2. There is a digit that appears infinitely often in the decimal expansion of 7. 


How can we establish such a theorem in light of the fact that so little is known 
about the decimal expansion of 77? As you may have already realized, the proof 
that I have in mind gives you no idea as to which digit appears infinitely often. It is 
a genuine indirect proof, as opposed to the usually cited examples of proof by 
contradiction—proofs that ¥2 is not rational, or that the number of primes is not 
finite—where the very meaning of the theorem requires that a contradiction be 
derived. 

Let’s look at a proof of this theorem. 


Proof: Suppose each digit occurs only a finite number of times in the decimal 
expansion of 7: so 0 occurs n, times, 1 occurs n,, times, etc. Then compute 

Re ny eng ed. 
places in the decimal expansion of 7. But there are only n, +n, + ++: +7, digits 
available to fill up these places, which is absurd, so our original assumption that 
each digit occurred only finitely many times must be false. | 


What has been proved? Let P, denote the proposition that the digit 7 appears 
(only) a finite number of times in the decimal expansion of a. Then-we have 
shown that the proposition 

A = P, and P, and::: and P, 
leads to a contradiction. That is, we have proved the negation =A of the 
proposition A. What we wanted to show, on the other hand, was that — P, holds 
for some digit i, that is we wanted to prove the proposition 
B= -—P, or =P, or::: or 4 Po. 
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According to the usual laws of logic (DeMorgan’s law), B is the same as — A, 
which we have proved. The constructivist grants that — A has been proved, but 
denies that B has been proved. He wants to draw a distinction between B, which 
asserts the existence of a digit with the property — P, and — A, which merely says 
that it is impossible for all digits to have the property P (meaningful distinctions 
deserve to be maintained). So we see what the rules are that constructivists want 
changed: no less than the rules of logic. 

The controversy over nonconstructive techniques in mathematics goes back at 
least to the beginning of this century. David Hilbert and L.E.J. Brouwer were the 
principal participants in this controversy. Brouwer founded the philosophy of 
mathematics called intuitionism [6]. Most constructive mathematicians, of whatever 
school, consider Brouwer to be a spiritual ancestor. Hilbert was the foremost 
mathematician in Germany, and possibly in the world, in the early twentieth 
century. Many American mathematicians trace their mathematical lineage to 
Hilbert because of the German mathematicians who settled in the United States 
prior to World War II, and the Americans who went to Germany in the early part 
of the century to study with Hilbert and his students. 

Hilbert used nonconstructive techniques to solve a well-known problem con- 
cerning the construction of a finite set of polynomials with certain properties. The 
problem had been solved by P. Gordan, for the case of polynomials in two 
variables, by explicitly exhibiting the finite set of polynomials. Hilbert solved the 
general case, but his proof gave no clue how to construct the required polynomials. 
This appears to be the first use of a nonconstructive proof to establish the 
existence of mathematical objects that were expected to be constructed explicitly. 
The reaction of Gordan, perhaps in reference to proofs of the existence of God, 
was “That’s not mathematics, that’s theology.” 

Since then, Hilbert’s approach has so dominated mathematical thinking that 
alternatives are not considered seriously. But Hilbert and Brouwer had one thing 
in common: they both thought that nonconstructive techniques needed justifica- 
tion. Brouwer thought that the answer was to use only constructive techniques. 
Hilbert did not want to abandon his nonconstructive techniques; instead, he 
proposed to show that you couldn’t get into trouble using them. 

Hilbert’s program was to show that if a theorem proved by nonconstructive 
means predicted the result of a computation, then it would predict the correct 
result. Thus even if the smallest digit that occurred infinitely often in the decimal 
expansion of 7 were just a fiction, we could treat it as reality and still never say 
anything that was verifiably false. An analogy is the introduction of a square root 
of —1, which in some sense is simply a fiction, but it helps us to prove things 
about, and to discover properties of, the numbers that we actually believe in. 

Of course the proof that you don’t get into trouble using nonconstructive 
techniques must use only constructive techniques in order to be convincing. 
Hilbert’s program was utterly demolished by Kurt Godel, who showed in the 
thirties that not only couldn’t you prove such a thing constructively, but you 
couldn’t even prove it using only the nonconstructive techniques that you were 
attempting to justify. Remarkably, this had no effect on the acceptability of 
nonconstructive techniques! 

My final example of a nonconstructive existence proof comes from the theory of 
computable functions. That theory was developed to clarify the idea of what it 
means for a function to be computable. This is not an idea that would occur to a 
constructivist, for whom computability is part of the intuitive idea of a function: if 
there exists y such that f(x) = y, then we must be able to construct that y if we 
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are given x. A constructivist can certainly entertain the idea of looking at a 
restricted class of functions that are computed in a special way, but would be 
unlikely to call that class “the computable functions.” 

In the theory of computable functions, a function from the natural numbers 
0,1,2,... to the natural numbers is called computable if there exists a computer 
program to compute it. It is well known that, for all but the most anemic 
programming languages, this notion is independent of the particular language 
used. In the standard theory, nonconstructive proofs of the existence of a program 
are allowed. 

It will be convenient to abbreviate the following statement by P.. 


P,: There are (at least) n consecutive 4’s in the decimal expansion of 7 .. 


Consider the function f defined by setting f(n) = 1 if P, and f(n) = 0 otherwise. 
Clearly f(0) = f(1) = 1. I asked my computer to give me 100 places of 7, and I see 
that there are 4’s in places 59 and 60 (if I haven’t miscounted), so f(2) = 1. But 
what, for example, is f(12)? No one knows; no one even knows of a computation 
that would resolve this question. If you compute a billion places in the expansion 
of a you might discover that f(12) = 1, but no such computation would tell you 
that f(12) = 0. A constructivist would say that we have not defined f for all n 
because we have not shown how to compute f(7) in general. 

The orthodox view, however, is that not only have we defined f for all n, but 
that f is computable! To verify the first claim, consider the set 


G = {(n,i):i=1and P,, ori = 0and not P,}. 


This is the graph of f. To show that f is defined at m means to show that there 
exists 7 in the set {0,1} so that (n,i) is in G. It is clear that if (m, 1) is not in G, 
then (n, 0) is in G, because either statement is equivalent to denying P,. But this 
constitutes a nonconstructive proof that there exists i for which (n, i) is in G, as it 
is impossible that neither (n, 1) nor (n, 0) is in G. 

So suppose that f is defined for all n—the constructivist will have to imagine 
that he can consult an oracle to determine the value of f(n). Now we want to show 
that there is a program that computes f. This requires a second nonconstructive 
argument. The gimmick is that we do not have to produce the program; we merely 
have to show that f cannot be different from each computable function. 

Consider the following collection of functions, one for each natural number m: 


g,(n) = es ifn<m 
0, fhm, 
together with the function g,, which is identically equal to 1. Certainly g., is 
computable, and g,, is computable for each finite m: the program simply com- 
pares n with the fixed number m and returns 1 or 0 as appropriate. Suppose 
f(n) = 0 for some n. Let m be the first place where f(m) = 0. Then f =g,, 
because f(x) = f(y) whenever x < y. Thus if f # g,, for each finite m, then f(n) 
cannot be 0 for any n, so f =g,,. Therefore f cannot be different from each 
computable function. That’s as far as a constructivist can go, even with an oracle 
that tells him what f(7) is. 
The final (nonconstructive) step in the argument is to apply an infinite version 
of the DeMorgan’s law used in the second example. We have shown that the 
proposition 


f #g,. and f # g, and f #g, and-:- 
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is false. We conclude that the proposition 
f = 8 Or f= 8 or f= g, Or * 


is true—if a collection of propositions cannot all be false, then one of them is true. 
So there exists m such that f = g,,, whence f is computable. 

I will let Wittgenstein have the last word. Georg Cantor laid the foundation for 
the theory of transfinite sets upon which twentieth century mathematics is based. 
Hilbert [7], referring to objections of the intuitionists, said ““No one shall drive us 
out of the paradise which Cantor created for us.” Wittgenstein [13, p. 103] later 
wrote in response to this, 


I would say, “I wouldn’t dream of trying to drive anyone out of this paradise.” 
I would try to do something quite different: I would try to show you that it is 
not a paradise—so that you'll leave of your own accord. I would say, “‘ You’re 
welcome to this; just look about you.” 
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Introduction to Metric-Preserving Functions 


Paul Corazza 


1. INTRODUCTION. Under what conditions on a function f :[0,~) > [0,) is it 
the case that for each metric space CX, d), fd is still a metric, and, moreover, d 
and fod are equivalent metrics? 

It is well-known that for any metric d, d/(1 + d) is a (bounded) metric that is 
equivalent to d. On the other hand, d/(1 + d*) need not be a metric; see 
Proposition 2.7. We call f:[0,°) — [0,%) metric-preserving (respectively, strongly 
metric-preserving) if for all metric spaces (X, d), fod is a metric (respectively, is a 
metric that is topologically equivalent to d). 

Although the first reference in the literature to the notion of metric-preserving 
functions seems to be [19], the first detailed study of these functions was by 
Sreenivasan in 1947 [16]. Kelley’s classic text in general topology mentions some of 
Sreenivasan’s results in an exercise [14, p. 131]: 


Exercise (Kelley). Suppose f :[0,°%) — [0,) is continuous, nondecreasing, and 
subadditive (f(x + y) < f(x) + fly) for all x, y). BupD ON also that f~'(O) = {0}. 
Then f is strongly metric-preserving. 


In the past two decades, a significant literature has developed on the subject of 
metric-preserving functions. The purpose of this paper is to introduce some of the 
results and techniques of the field to a broader mathematical audience. 

We begin our study of metric-preserving functions in the next section, where we 
build tools to understand these functions and consider some revealing examples. 
Section 3 examines the important relationship between strongly metric-preserving 
functions and continuity. In the final section, we survey some of the results on 
differentiability in the context of metric-preserving functions. Many of the results 
discussed here have been garnered from papers that have appeared in other 
languages and in journals unfamiliar to the author; special thanks go to Jozef 
Dobos for his tremendous help in making so many of these papers available to me. 
Space constraints prevent me from discussing many avenues of research related to 
metric-preserving functions that have been pursued by various authors; see [8] for 
an excellent list of references. 

An interesting application of metric-preserving functions was discovered by Juza 
in 1956, long before the subject had matured [13]. It is now well-known that there 
are complete nowhere discrete metric spaces that have a nested sequence of closed 
balls with empty intersection (of course the diameters of such balls cannot tend to 
0). Jiza observed that the real line could be topologized to obtain such a space, 
using a metric-preserving function; in particular, he showed that (R, f° e) has the 
required property if e is the usual metric on R, and f is the metric-preserving 
function defined by 


xX ifx<2 


AC aka eee if x > 2. (1.1) 


A 
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We prove that f is metric-preserving in the next section; see [10] for technical 
refinements of this result. 


2. METRIC-PRESERVING FUNCTIONS. A metric space is a set X together 
with a function d: X x X — [0,) satisfying the following three conditions: 


(M1) For all x, y © X, d(x, y) = 0 if and only if x = y; 
(M2) For all x, y © X, d(x, y) = d(y, x); and 
(M3) For all x, y, z © X, d(x, y) + dy, z) => d(x, z). 


If a real-valued function f defined on a set S C R is metric-preserving, then S 
must include [0,), its range must lie in [0,), and f '(O) = {0}; we call such 
functions amenable. Since the values that an amenable function may have at 
negative reals have no bearing on whether the function is metric-preserving, we 
further require that all amenable functions have domain precisely [0, ©). 

The next proposition identifies a basic property of all metric-preserving func- 
tions: 


Proposition 2.1. If f is metric-preserving, then f is subadditive. 


Proof: Let a,b € [0,~) and let d be the usual metric on R. Then 
f(a) + f(b) = (fe d)(0,a) + (fed)(a,a + b) 
>(fed)(0,a+b)=f(atb). Fs 
Terpe used the subadditivity criterion to show that a fairly broad class of 
functions is not metric-preserving. Before stating his result, we recall that a 
function f :[0,) — [0,%) is convex on [0, c] if 
x+y) f(x) +f) x+y 
——— ———— So ZA 
f | 2 | ane: | 2 | 1) 
whenever 0 <x < y <z<c, where the graph of g is the line passing through 


(x, f(x)), Cy, fGy)). Moreover, f is strictly convex if (2.1) holds when < is replaced 
by <. 


Corollary 2.2 [18]. Given f :[0,%) — [0, ©), suppose that either 


(A) f is strictly convex on some interval including the origin and f(0) = 0, or 
(B) f is differentiable on (u,~) for some u > 0 and lim, .,,.f'(x) = +. 


Then f is not metric-preserving. 


Proof: For (A), let c be a positive number for which f is strictly convex on [0, c]. 
Then f(c/2) < f(c)/2, whence f(c/2) + f(c/2) < f(c), which violates subadditiv- 
ity. | 

For (B), assume f is differentiable on (u,™), lim, _,,,f’(x) = +, and f is 
metric-preserving. Let x) > u. Because f’ tends to +, there is an r > 0 such.that 
for all x > r, f'(x) > f(x) /Xq. Pick x, > r and use the Mean Value Theorem to 
obtain a y € (x,,x, + x,) such that f’(y) = (f(x, + x9) — f(x) /x. It follows 
that (f(x, + x9) — flx,))/x9 > f(%9)/xXp, which violates subadditivity. a 


Borsik and Dobos [1] give an example of a metric-preserving function f that is 


differentiable on (0,°%) and satisfies lim sup, _,,.f’(x) = +, showing that the 
condition “lim, _,,,f’(x) = +” in Corollary 2.2(B) is optimal. In the same paper, 
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the authors extend Corollary 2.2(A); we state their result in Theorem 3.5. The 
proof makes use of a symmetry between subadditive and convex amenable func- 
tions, which is developed in the following exercise: 


Exercise 1 [1]. 


(1) Suppose f :[0,~) — [0, °°) is subadditive. Show that for all positive integers 
n, f(nx) < nf(x) and f(x/2") = f(x)/2” whenever x > 0. 

(2) Suppose f is amenable and convex on [0,c]. Show that for all positive 
integers n, f(x/2") < f(x)/2” whenever 0 <x <c. 


While subadditivity is an important necessary condition, the function x/(1 + x”) 
shows that subadditivity is not sufficient for an amenable function to be metric- 
preserving. However, adding “nondecreasing” to subadditivity does yield a suffi- 
cient condition: 


Proposition 2.3. Suppose f is amenable, subadditive, and nondecreasing. Then f is 
metric-preserving. 


Proof: Let (X, d) be a metric space; we show f° d is a metric. Properties (M1) and 
(M2) are easy to check. For (M3), let x, y, z © X, and let a = d(x, y), b = dy, 2), 
and c = d(x, z). It suffices to show that f(a) + f(b) > f(c). But 


f(a) + f(b) =f(a+b) (subadditive) 
> f(c) (nondecreasing ) , 


as required. ra) 


Terpe [18] noticed that if g:[0,0) — [0,°) is non-increasing, then [jf g(t) dt is 
subadditive, and thus by Proposition 2.3, it is metric-preserving. 

Another application of Proposition 2.3 involves concave functions: A function 
f :[0, 0) — [0,%) is concave if for all x, y => 0, 


(=) : f(*) ed | 


Clearly, f is concave if and only if —f is convex on [0, c] for every c > 0. Ger and 
Kuczma [12] showed that concave amenable functions must be nondecreasing; 
since such functions are easily shown to be subadditive, we can use Proposition 2.3 
again to conclude that they are also metric-preserving. 


Exercise 2. Use either Proposition 2.3 or the fact that concave amenable functions 
are always metric-preserving to show that log,(1 + x), with a > 1, and x’, with 
0 <r<1, are metric-preserving. 


Another interesting example of a metric-preserving function was discovered by 
Dobo§ [7] who showed that the extended Cantor function (extended to have value 
1 for all x > 1) is subadditive, and hence, by Proposition 2.3, is metric-preserving. 

Our examples so far have been both nondecreasing and continuous. A simple 
example of a discontinuous nondecreasing metric-preserving function is 


f(xy = {9 if x =0 


otherwise. 
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Proposition 2.3 ensures that this function is metric-preserving. It is also possible to 
obtain a continuous, metric-preserving function that fails to be nondecreasing. In 
order to construct this and other related examples, we need the notion of a triangle 
triplet, which is used to characterize metric-preserving functions. This notion first 
appeared in Sreenivasan’s early paper [16], though the terminology itself did not 
appear in the literature until [2]. 


Definition 2.4. A triangle triplet is a triple (a, b, c) of nonnegative reals for which 
a<b+c,b<a+c,andc <a+b; equivalently, |a-—b|l <c<at+b. 


Triangle triplets are precisely those triples of nonnegative reals that are of the 
form (d(x, y), d(y, z), d(x, z)) for some metric space CX, d) and some x, y, z € X. 
This observation follows from Proposition 2.5 and the proof of Proposition 2.6. 


Proposition 2.5. If (X,d) is a metric space and x, y,z € X, then 
(d(x, y), dCy, z), d(x, z)) is a triangle triplet. 


Proof: This is immediate from the triangle inequality. 


Proposition 2.6 [2]. Suppose f is amenable. Then the following are equivalent: 


(1) fis metric-preserving; 
(2) for each triangle triplet (a, b, c), (f(a), f(b), f(c)) is a triangle triplet. 


Proof of (1) = (2). Given a triangle triplet (a, b,c), let d be the usual metric on 
R’. A straightforward argument using elementary geometry shows that there are 
u, V,w © R? such that d(u,v) = a, d(v,w) = b, and d(u,w) =c. The result now 
follows from Proposition 2.5. x 


Proof of (2) > (1). Given (X, d), we verify that fd is a metric. Properties (M1) 
and (M2) are immediate. For (M3), use (2) and the fact that (d(x, y), d(y, z), 
d(x, z)) is always a triangle triplet for x, y,z © X. is 


For metric-preserving functions f, we obtain from Proposition 2.6(2) the in- 
equality 


Ifa) — f(4)| s f(a — b]) (2.2) 


by letting c = |a — DI. 
Das [5] offers an alternative to Proposition 2.6(2) in his characterization of 
metric-preserving functions. A second alternative is the following: 


Exercise 3((16], [18]). Show that Proposition 2.6(2) can be replaced by 
for each triangle triplet (a,b,c), f(a) < f(b) +f(c). (2') 


The proof of Proposition 2.6 shows that an amenable function f is metric-pre- 
serving if and only if fod is a metric on R* whenever d is. DoboS gives an 
interesting example that shows R* cannot be replaced by R [6]. He builds a 
variation f of the extended Cantor function that preserves metrics on R and has 
the property that lim inf,, , f(x) = 0. The next proposition shows that such func- 
tions cannot be metric-preserving: 
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Proposition 2.7. Suppose f is metric-preserving. 


(1) For each x. > 0, there is an € > 0 such that f(x) = € for eachx >x,. In 
particular, lim inf, , 9 f(x) > 0, and (x, 0) is not a limit point of the graph of f 
for any x > 0. 

(2) If f is discontinuous at 0, there is some € > 0 such that f(x) > ¢€ for allx > 0. 


Proof of (1). If the assertion is false, there are x) > 0 and a sequence (x, >, such 
that x, >, for all n, and (f(x,)), — 0. Let n be such that f(x,) < f(x9)/2. 
Then (x,,, X,,, Xo) is a triangle triplet, but (f(~,,), f(x,,), f(%9)) is not. i 


Proof of (2). If (2) is false, it follows from (1) that there is a decreasing sequence 
(X,)n Such that (x,), 2 0 and <f(x,)>, — 0. By discontinuity at 0, there are 
€ > 0 and a sequence ‘jy, ), converging to 0 such that f(y,) => e for all n. Now let 
n be such that x, < €/2 and let m be such that y, <x,. Then (x,,x,, y,,) isa 
triangle triplet, but (f(x,), f(x,), fCy,,)) is not. ui 


Proposition 2.7 shows that metric-preserving functions cannot have the x-axis as 
a horizontal asymptote; thus, the function x/(1 + x”) is not metric-preserving. 

Using Proposition 2.6, we can now exhibit quite a variety of discontinuous 
metric-preserving functions. We call an amenable function f tightly bounded if 
there exists a v > 0 such that f(x) € [v, 2v] for all x > 0. 


Proposition 2.8 [2]. If f is amenable and tightly bounded, then f is metric-preserving. 


Proof: Let v > 0 be such that f(x) € [v,2v] for all x > 0, and let (a, b, c) be a 
triangle triplet. Since the cases in which abc = 0 are trivial, we assume abc > 0. 
Then f(a) < 2v=v+u<f(b)+f(c), and Exercise 3 gives the desired conclu- 
sion. iz 


Any amenable, tightly bounded function is necessarily discontinuous at 0. It 
follows that there are 2° tightly bounded, amenable functions (where c is the 
cardinality of R), so “most” metric-preserving functions are not continuous. 
A typical pathological example that one can construct with Proposition is the 
following: | 


Proposition 2.9. There exists a metric-preserving function that is nowhere continuous 
and nowhere of bounded variation. 


Proof: Let {A, B} be a partition. of (0,%) such that both A and B are dense in 
(0, ©). Define f:[0,) — [0, %) by 


0 ifx=0 
f(x)= 51 ifxEeA 
2 wxesB. 


Since f is amenable and tightly bounded, it is metric-preserving; because of the 
choice of A and B, f has the required pathologies. eB 


The example given in Proposition 2.9 shows, in particular, that metric-preserv- 


ing functions need not be nondecreasing. Pokorny [15] has isolated a fairly natural 
class of amenable functions for which all metric-preserving functions must be 
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nondecreasing: Define 
¥ = {g: for some periodic h:[0,~) > [0,~), g(x) =x + h(x) for all x > O}. 


An example of a member of % is x + |sin(x)|. Pokorny showed that for amenable 
members f of the class Y, f is metric-preserving if and only if f is nondecreasing 
and subadditive. 

Proposition 2.9 also shows that metric-preserving functions need not be of 
bounded variation on any interval. Nonetheless, most of our examples of metric- 
preserving functions have this property. Terpe [17] formulated a sufficient condi- 
tion, involving the notion of bounded gradient: Given r > 0, we say that a metric- 
preserving function f is of r-bounded gradient at 0 if there is some h > 0 such that 
f(x) < ~ for all x € [0, h]. And we say that f is of bounded gradient at 0 if there is 
some r > 0 such that f is of r-bounded gradient at 0. 


Proposition 2.10 [17]. Jf fis metric-preserving and of bounded gradient at 0, then f is 
of bounded variation on each closed interval lying in [0, ©). 


We postpone the proof of Proposition 2.10 until Section 4. There, we show that 
a metric-preserving function is of bounded gradient at 0 if and only if the 
derivative of f at O exists and is finite. We are establishing a global property of f 
(namely, bounded variation on each closed interval) based on the behavior of f at 
0. This theme reappears when we consider continuity in the next section. 

For the remainder of this section, we discuss techniques for building new 
metric-preserving functions from old and apply these to answer several natural 
questions: | 


Q1. Can a metric-preserving function be strictly decreasing on an interval 
(a,~), a > 0? 
Q2. Can a continuous metric-preserving function be strictly decreasing on an 
~ interval (a,), a > 0? 
Q3. Must every continuous, nondecreasing metric-preserving function be con- 
cave? 
Q4. Must every discontinuous metric-preserving function be tightly bounded? 


Q5. Must every metric-preserving function that is continuous at 0 be continu- 
ous? 


To answer Question Q1, we first show that every bounded function [0,~) > 
[0, 0) has an upward translation that is tightly bounded. For each f :[0, °°) — [0, ©) 
and each r > 0 we define 


0, if x = 0 
= ee) +r, ifx>0. 


Proposition 2.11. Suppose f :[0,%) — [0,%) is bounded above. Then there is an 
ry > 0 such that U;, , is metric-preserving for all r > ro. 


Proof: Let ry be an upper bound for f and note that U; , is tightly bounded for all 
r 2p. a 
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Example 2.12. A metric-preserving function that is strictly decreasing on (0, °). 


Define 
0 ifx=0 
B(x) = 144 if x > 0. 
x 
Now, g = U,, where 
0 ifx=0 
Ce if x > 0. 
x+1 


By Proposition 2.11, g is metric-preserving and is decreasing on (0,). This 
example answers Question Q1. 
The next proposition provides a tool for answering Questions Q2 and Q3. 


Proposition 2.13 [2, Proposition 2.16]. Suppose f is metric-preserving and suppose 
c,r > 0. Define T, . , :[0,%) — [0,%) by 


1X if x € [0,c) 


T — 
fre,r(*) f(x) otherwise. 


Then T, , , is metric-preserving if and only if f(c) = re and |f(x) — f(y)| < rlx — y| 
for allx, y € [c, ©). be 


In [11], the authors generalize this result, replacing the function rm in the 
defintion of T, ,, by a concave metric-preserving function g :[0,) — [0,%). Let 
T, .,- denote the function defined from f and g in this way. As the authors of [11] 
show, if lx — y| <c implies |f(x) — f(y)| < g(x — yl) for all x, y > c, then T, , , 


is metric-preserving. 


Example 2.14. A metric-preserving, continuous function that is strictly decreasing on 
(1,0). Let g be as in Example 2.12. Define 

Xx if x € [0,1] 

g(x) otherwise. 


T(x) = | 


Clearly, T is continuous and strictly decreasing on (1, ). Since T = T, , ;, Proposi- 
tion 2.13 ensures that T is metric-preserving. This example settles Question Q2. It 
also shows that continuous metric-preserving functions need not be nondecreasing. 


Example 2.15 [18]. A continuous, nondecreasing, metric-preserving function that is 
not concave. Define 


1 Lo ax <2 
f(x)= 5x-1 if2<x <3 
2 otherwise. 


Since f is tightly bounded, f is metric-preserving. Define 


x if x € [0,1] 
f(x) otherwise. 


T(x) = | 
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Clearly T is continuous and nondecreasing; T is not concave since 
a+b T(a)+T(b 
_ F(a) + Tb) 
2 pi 


when a = 1 and b = 3. Since T = T,; , ,, Proposition 2.13 can be applied to show 
T is metric-preserving. This example settles Question Q3. 


Exercise 4 [13]. Use Propositions 2.11 and Proposition 2.13 to show that Juza’s 
function (1.1) is metric-preserving. 


To answer Question Q4, we invoke one of the closure properties of the class of 
metric-preserving functions. We summarize these in the following theorem; we 
omit the straightforward proofs. 


Theorem 2.16 [1], [2], [17]. 


(1) If f, g are metric-preserving and m > 0, then each of fog, f+g,mf and 
max( f, g) is metric-preserving. 

(2) If (h,)n are metric-preserving functions that converge pointwise to a function h 
and h(x) >0 for all x, then h is metric-preserving. Likewise, if <> _,h,, 
converges to a function h, where each function h,, is metric-preserving, then h 
is metric-preserving. 

(3) If S is any set of metric preserving functions that is pointwise bounded and if we 
define g(x) = sup{f(x): f € S}, then g is metric-preserving. 


Example 5 [2]. A discontinuous and metric-preserving function that is not tightly 
bounded. Define 
0 if x = 0 
ia) f +|x—1| otherwise. 
The function f is discontinuous at 0 and not tightly bounded. Now, f = max(g, h), 
where g(x) =x and 


0 ifx=0 
h(x) = 1+|x-1| if x € (0,2) 
2, otherwise. 


Since h is metric-preserving (because it is tightly bounded), and g is metric-pre- 
serving, it follows from Theorem 2.16(1) that f is metric-preserving as well. This 
example settles Question Q4. 


We postpone a discussion of Question Q5 until the next section, where it is a 
central topic. 


3. STRONGLY METRIC-PRESERVING FUNCTIONS. In this section we charac- 
terize the metric-preserving functions that are strongly metric-preserving. An 
important theme here is the significance of the behavior of a metric-preserving 
function at 0: We show that such an f is strongly metric-preserving if and only if it 
is continuous at 0. , 

We begin with some notation and an observation. For a metric space CX, d), 
x €X, e> 0, write: 


N(d,x,€) ={y €X:d(x,y) < e}. 
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Lemma 3.1. Suppose f is metric-preserving. Then the following are equivalent: 


(1) fis discontinuous at 0; 
(2) fed is a discrete metric for every metric d. 


Proof of (1) = (2). Let (X, d) be a metric space. By Proposition 2.7, there is an 
€ > 0 such that f(z) > e for all z > 0. Then N(f od, x, €) = {x} for each x © X, 
as required. z 


Proof of (2) = (1). Let d be the usual metric on R and let € > 0 be such that 
N(f od, 0, €) = {0}. It follows that there is a sequence (x, ), of positive numbers 
converging to 0 (relative to d) such that e < f(d(x,,0)) = f(x,) for all n. This 
establishes (1). si 


The next theorem first appeared in [2]; one direction of the theorem was 
observed in [16]. 


Theorem 3.2 [2]. A metric-preserving function is strongly metric-preserving if and only 
if it is continuous at 0. 


Proof: One direction follows from Lemma 3.1. For the other direction, suppose f 
is continuous at 0 and metric-preserving. Let (X,d) be a metric space. We show 
that fod and d are equivalent metrics. Let e > 0 and x € X. By continuity, let 
6 < € besuch that f(z) < € whenever 0 < z < 6. Then N(d, x, 8) CN(f od, x, €). 
Hence, the d-topology refines the f° d-topology. To show the converse, again start 
with x © X and let r > 0. Use Proposition 2.7(1) to obtain an e€ > 0 such that 
f(z) = e€ for all z > r. But now N(fed,x,€) C N(d,x,r),and we are done. mm 


In [2], the authors develop Theorem 3.2 further by providing necessary and 
sufficient conditions on a metric-preserving function f for d and fed to be 
uniformly equivalent as well. 

As we now show, continuity at 0 forces a metric-preserving function to be 
continuous everywhere; this result answers Question Q5 from Section 2. 


Theorem 3.3 [2]. Suppose f is metric-preserving and continuous at 0. Then f is 
continuous on [0, ©). 


Proof: Assume that f is not continuous at some x, > 0. Let e > 0 be such that 
there are z arbitrarily close to x, for which |f(z) — f(x))| = e. By continuity at 0, 
let 6<x,)/2 be such that 0 <a < 6 implies f(a) < e. Now pick z, so that 
IZy —Xol < 6 and [f(zo) — f(xp)| = €. Let ay = |zy — Xo). 


Case 1. f(z,) = f(x) + €. 


If x9 + ad) = Zp, then f(x.) + flag) < flxo) + € < f(%_ + a), which violates 
subadditivity. 

On the other hand, if z) + ay) =X, notice that (xo, Zo, aq) is a triangle triplet, 
and in particular that x) — z) = a, < 6. But since f(z.) — f(xp) = € and flay) < 
€, (f(x), f(z), f(ao)) is not a triangle triplet, which violates Proposition 2.3. 
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Case 2. f(x,) = f(zo) + €. 


The proof in this case is similar to that in Case 1, and we omit it. 
In each case, we have obtained a contradiction from the assumption that x is a 
point of discontinuity, as required. s 


Thus, for a metric-preserving function f, the global properties of continuity and 
being strongly metric-preserving are completely determined by the behavior of f at 
0. And continuity of f at 0 is determined by a property that is apparently even 
weaker: It follows from Proposition 2.7(2) that f is continuous at 0 if and only if, 
for each e > 0, there is an x > 0 with f(x) < e. We have proved the following: 


Theorem 3.4 [2]. Suppose f is metric-preserving. Then the following are equivalent: 


(1) fis strongly metric-preserving; 

(2) fis continuous at 0; 

(3) fis continuous on [0, ©); 

(4) for each € > 0, there is anx > 0 with f(x) < e. eal 


As an application of Theorem 3.4, we give a proof of a result in [1] that 
improves upon Corollary 2.2(A): 


Theorem 3.5. If f is metric-preserving and is convex on [0, c] for some c > 0, then f is 
linear on (0, cl. 


Proof: Let f be metric-preserving and convex on [0,c]. We establish the conclu- 
sion of the theorem from the following three claims: 
Claim 1. For each x € [0, c] and each positive integer n, f(x/2") = f(x)/2". 
Claim 2. The function f is continuous. 
Claim 3. Whenever 0 <x <y<c, f(x)/x < fly). 

Using these claims, we prove the result by showing that f(x) = (f(c)/c)x for 


all x € [0,c]. Since this relation is obvious for x = 0, let x € (0,c], and let n be 
such that c/2” <x. Then by Claims 1 and 3, 


gn 


9 


as required. 

We turn to the proofs of the claims. Claim 1 is proved by combining parts (1) 
and (2) of Exercise 1. For Claim 2, we use Claim 1 and Theorem 3.4(4): For any 
e > 0, we obtain x for which f(x) < € by setting x = €/2”, where n is such that 
fle) /2”" < €. 

To prove Claim 3, let A = {u € [0,c]: f(x) /x < fly) /y whenever 0 <x <y< 
u}. Vacuously, A is nonempty. Let z = supA; we prove A = [0,c] by showing 
z = c. Seeking a contradiction, assume z < c. There are two cases to consider: 


Case 1. z ¢ A. From our assumptions, it follows that there is an x) < z such that 
f(x) /x > f(z)/z whenever x, <x <z. For each such x, let g,:[0,) — [0, ~) 
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denote the function whose graph is the line through (0,0) and (x, f(x)). By the 
choice of z, whenever x9 <x <y <z, we have g,(z) > f(z) and g,(y) = g,(y). 
By continuity of g,, there is a 6 such that 0<65<z-—vx, and g,(x) > f(z) 
whenever 0 <z—x < 6. Let x, =z — 6, and let € = g, (x,) — f(z). Then, when- 
ever 0 <z-x< 6, 


f(x) — f(z) = 8,(*) — f(z) 2 82,(*) — f(z) = 8,(41) — f(z) = €, 
which contradicts the continuity of f at z. 


Case 2. z © A. By our assumptions, we can find y such that f(z)/z > fly)/y 
where y > z and y is arbitrarily close to z. Pick such a y with y —z <z. Set 
x = 2z-—y. Then z = (x + y)/2, and by a straightforward computation, 


fee) =f), 0) =f) aa 


However, it is easy to verify that 


ie) 


- a a < HZ) = Fla) | (3.2) 


f(z) . 
< —— impli 
Z ZX 


and 


fo) <2, = 


impli 
y 


Combining the conclusions of (3.2) and (3.3), we have 


fz) - fF), 10) -f@ 


ZX y-z 


9 


which contradicts (3.1). This completes the proof of Claim 3 and the theorem. Mm 


Metric-preserving functions can be viewed as a tool for producing new metrics 
of the form fod on a given metric space CX, d). The results of this section show 
that the usefulness of this tool does not lie in generating topologically distinct 
metrics; indeed, at most two distinct metrics, up to topological equivalence, can be 
obtained in this way: the discrete metric and d itself. More useful is the fact that 
whenever X has cardinality at least 2, there are c distinct metrics on X of the 
form fod, where f is metric-preserving. This fact suggests that the class of 
metric-preserving functions may be a rich source for constructing metrics that are 
topologically equivalent to d but that exhibit new, mathematically interesting 
properties. Indeed, Jiza’s function (1.1) is just this kind of example. 


4. METRIC-PRESERVING FUNCTIONS AND DIFFERENTIABILITY. Mirror- 
ing the situation for continuity, the notion of differentiability partitions the class of 
metric-preserving functions into two rather different subclasses determined by the 
value of the derivative of each function at 0. We shall see that the (extended) 
derivative of such a function always exists at 0; the central question is whether the 
derivative is finite or infinite. Functions with finite derivative form a well-behaved 
class of continuous functions that are differentiable almost everywhere; functions 
with infinite derivative, by contrast, can be very unruly—they can be continuous, 
nowhere differentiable (in the finite sense), and even, as we saw in Section 2, 
nowhere continuous. In this section we outline proofs of these results, which we 
have extracted from [1]. | 
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We first show that for any metric-preserving function f, f’(0) exists in the 
extended sense. The proof naturally divides into two parts depending on whether 
the set K, = {r > 0: f(x) < m™ for all x > 0} is empty. In the course of the proof, 
we show that f’(0) < + if and only if K, # ©, and f’(0) = + if and only if 
K, =. The next lemma is the first step in proving the result for the latter case. 


Lemma 4.1 [1]. Jf f is metric-preserving and 0 <x < y, then f(y)/y < 2f(x)/x. 


Proof: Let n be a positive integer such that 


grtect <9" (4.1) 

x 
Then (x, x, y/2”') is a triangle triplet since y/2”~' < 2x. Thus, f(y)/2""! < 
f(y /2""') < 2f(x), where the first inequality follows from Exercise 1(1). Hence, 
using (4.1), f(y) < 2” '(2f(x)) < 2yf(x)/x, and the result follows. ce 


We can now prove that f’(0) exists and is infinite when K,; = 0: Let n be a 
positive integer. Since K,;= 0, we can pick y > 0 such that f(y) = 2ny. Let 
x € (0, y]. By Lemma 4.1, 2n < f(y)/y < 2f(x)/x. But now we have shown that 
for each integer n > 0, there is y > 0 such that f(x)/x >n whenever 0 <x <y, 
as required. 

We turn to the case in which K, # ©. Suppose f is metric-preserving and that 
there is an h > 0 such that f(x) < m~ for all x € [0, h] (that is, f is of r-bounded 
gradient at 0, as in Proposition 2.10). We show that in fact, f(x) < ~ for all x = 0. 
To see this, let x => O and let n be a large enough integer so that x/2n <h. By 
Exercise 1(1), f(x)/2" < f(x/2") < m/2", whence, f(x) < 1m. Combining this 
result with (2.2), we also have |f(x) — f(y)| <r|x — y| for all x, y > 0. Thus: 


Proposition 4.2 [1]. Suppose r>0 and f is metric-preserving and of r-bounded 
gradient at 0. Then, 


(1) f(x) < x for all x = 0; 
(2) f(x) — fly)| < rlx — y| for allx, y > 0.G 


The next result that we need is a generalization of Theorem 3.5: 


Lemma 4.3 [1]. Suppose f is metric-preserving and r > 0. If in every neighborhood of 
0 there is a point a such that f(a) = ra, then there is anh > 0 such that f(x) = rx for 
all x € [0, h]. 


Proof: The hypothesis and Theorem 3.4(4) imply that f is continuous. Let h > 0 
be such that f() = rh. Assume there is x € (0,h) with f(x) #x. Let A={ye€ 
[0, 00): f(y) = ry}. By continuity of f, A A [0, x] is compact and so has a maximum 
element, which we denote by my ,; likewise, A ™ [x, h] is compact, and we denote 
its minimum element by m,, ,. 

If f(x) > 7, we show there is a number u € (mo, ,, x) such that f(u) < mu, 
from which it follows, by continuity, that f(z) = rz for some z € [u, x), contradict- 
ing the choice of my ,. To obtain u, let y€ A with 0<y <x — my, ,. Now let 
u=y + my , and observe that 


fly + mp ,) < f(y) + f(mpo x) = 1) + 1M , =r(y + Mp ,)- 
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Similarly, if f(x) <r, one obtains a u € (x, m,,,,) such that f(u) = mu, yielding 
a contradiction, as before. Here, pick z€@A with 0<z<m,,-—x and set 
u=m,, —zZ. We leave to the reader the verification that u has the required 


property. i 


Now we prove that f’(0) exists and is finite in the case K, # . By Theorem 
3.4, f is continuous, whence K y is closed. Let rp) = minK,. We show 
f(x) 


r= lim ——. (4.2) 


Let e > 0. We claim that: 
for each h > 0, there is an x €(0, A] such that f(x) > (r, — €)x. (4.3) 


To see this, assume instead that there is an h > 0 such that for all x € [0, A], 

f(x) < r,x, where r, =r, — €. Thus, f is of r,-bounded gradient at 0, and so by 

Proposition 4.2(1), f(x) < r,x for all x > 0, which contradicts the choice of rp. 
Next, we show that 


there exists an h > 0 such that f(x) > (7) — €)x forall x €(0,h]. (4.4) 
Assume instead that 

for each h > 0, there exists an x € [0, A] such that f(x) < (7) — €)x. (4.5) 

Let h > 0. By (4.3), there is an x, € (0,/] such that f(x,) > (ry) — €)x,, and by 

(4.5), there is an x, € (0, /] such that f(x.) < (ry) — €)x,. By the continuity of f, 

there is an x, € (0,/] such that f(x,) = (7) — €)x3. By Lemma 4.3, f(x) = (1% - 
€)x holds on some neighborhood of 0, contradicting (4.3). This proves (4.4). 

Now since ry © Ky, we also have f(x) < (7) + €)x for each x > 0. Thus, 


combining these results, we obtain: for each e > 0, there exists an h > 0 such that 
ry — € <f(x)/x < ry + € whenever 0 < x < h; that is, (4.2) holds, as required. 


We have proved: 


Theorem 4.4 [1]. Let f be a metric-preserving function. Then f'(O) exists (in the 
extended sense). es 


We now show that a metric-preserving function f with finite derivative at 0 
must be differentiable almost everywhere. We begin with a key lemma: 


Lemma 4.5 [1]. Suppose f is metric-preserving and f'(0) < +. Then 


(1) f(x) < f’(©) for all x = 0; 
(2) If(x) —fQ)| < f’'O|x — yl forall x, y = 0. 


Proof: To prove (1), let e > 0. Since f’(0) < +, there is an h > 0 such that 


f(x) < (f'(0) + €)x (4.6) 
for all x € [0, h]; that is, f is of (f'(0) + €)-bounded gradient at 0. By Proposition 
4.2, (4.6) holds for all x > 0. Since € > 0 was arbitrary, the result follows. Part (2) 
can be proved by again applying (2.2). a 


The proof of Lemma 4.5(1) shows that if f is metric-preserving and f’(0) < +%, 


then f is of bounded gradient at 0. The converse is also true, and follows 
immediately from Theorem 4.4. Thus: 


1999] METRIC PRESERVING FUNCTIONS 321 


Corollary 4.6. Let f be a metric-preserving function. Then f'(0) < + if and only if f 
is of bounded gradient at 0. fa 


Our target theorem falls out directly from Lemma 4.5(2) and establishes 
Proposition 2.10: 


Theorem 4.7 [1]. Suppose f is metric-preserving and f'(0) < +. Then f is of 
bounded variation on each closed interval lying in [0,). Hence, f is differentiable 
almost everywhere. a 


Finally, we consider the subclass of metric-preserving functions f for which 
f'0) = +. The following example, due to Dobos and Piotrowski, is a slight 
modification of Van der Waerden’s continuous nowhere differentiable function [3]. 


Example 4.8 [11]. A metric-preserving function that is continuous and nowhere 
differentiable. Define h :[0, °°) — [0, ©) by 


Fs 

os 

IA 
em Nie 


h me 
i) 5 +|x -[x] -3 ifx > 5, 


where [a] denotes the integer part of a. It is easy to verify that h = T, 1, for some 
tightly bounded function g (recall Proposition 2.13), and so h is metric-preserving. 
Likewise, for each integer n > 0, h(2” -x) is metric-preserving, and using Theorem 
4.9(1), so is h(2"-x)/2". Theorem 4.10(2) ensures that 


2 A ee) 
LOS 2a 

n= 0 
is metric-preserving as well. The proof that f is continuous and nowhere differen- 
tiable is essentially the same as Van der Waerden’s. 


The extent to which the class of metric-preserving functions is structured 
around their behavior at 0 has been a striking theme in this brief survey. Whether 
a metric-preserving function f transforms each metric d to a discrete metric or to 
a metric that is topologically equivalent to d is determined by whether f is 
continuous at 0. And whether f has the property of being (finitely) differentiable 
almost everywhere but nowhere infinitely differentiable is determined by whether 
f'(O) is finite or infinite. Although the results presented here offer many possible 
directions for generalization and further study, one of the most compelling is this: 


Which behaviors of a metric-preserving function f at O0—such as continuity at 0 
or “f'(O) < +°’*—determine interesting global properties of f ? 
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Someone wrote a book called The Joy of Math. 
Maybe I'll write a book called the The Pathos of Math. 
For through the night I wander 

between intuition and calculation 

between examples and counter-examples 

between the problem itself and what it has led to. 
I find special cases with no determining vertices. 

I find special cases with only determining vertices. 
I weave in and out. 

I rock to and fro. 

I am the wanderer 

with a Iemma in every port. 


Contributed by Marion Cohen, Drexel University, Philadelphia, PA 
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Magic Dice 


Bernard D. Flury, Robert Irving, and M. N. Goria 


1. INTRODUCTION. A magician offers her audience a betting game with two 
dice. With X and Y denoting the numbers shown by the first and the second die, 
respectively, the magician wins one ruble for 12 of the 36 possible outcomes (x, y), 
and loses 1 ruble for another set of 12 outcomes, as indicated in Table 1. No 
payment is made in the remaining cases. This seems to be a fair game, but in fact 
the two dice are not independent, as seen from their joint distribution displayed in 
Table 1. Soon enough the audience realizes that every time a ruble is paid it is in 


TABLE 1. Joint Probabilities for six-sided magic dice. 


Columns correspond to values of X; rows to values of Y. Entries are probabilities in multiples of 1/36. 
The audience wins if (X,Y) = (1,3), (1,5), (2, 3), (2,5), G, 3), G, 6), (4, D, 4, 4), G, 2), G, 4), (6, 2), or 
(6,4). The magician wins for 12 other (arbitrarily chosen) outcomes. 


the magician’s favor. This happens despite the fact that two observers who tally the 
frequencies of X and Y, respectively, find that both dice show each side with 
exactly the required relative frequency of 1/6. 

Now as almost everybody. knows, marginal probabilities do not determine joint 
probabilities, and therefore the audience asks for a third observer. The magician 
agrees and lets someone tally the frequencies with which X + Y takes values 2, 
3,..., 12. Sure enough, these are found to be 1/36, 2/36, etc, just as expected if X 
and Y were independent fair dice. Hence a fourth observer is admitted who tallies 
X — Y, and finally a fifth observer who tallies X + 2Y. When asked to admit a 
sixth observer, however, the magician stops the game. 

As can be verified from Table 1, none of the five observers studying the 
distributions of X, Y, X + Y, X — Y, and X + 2Y is able to detect that something 
is wrong with the pair of dice. Indeed, all five observers find exactly what they 
expect if X and Y are independent regular dice, and the deviations of the joint 
probabilities from their fair value 1/36 remains undetected. 

The obvious question is, how many different observers can the magician admit 
without giving away the secret? To make this question clear, we formulate the 
rules of the game more precisely. Each time the magician is asked to admit 
another observer, she chooses a linear combination aX + bY (with real coefficients 
a and b) that is not proportional to any of the previously assigned linear 
combinations. She constructs her dice such that she can admit as many observers 
as possible, yet be able to offer unfair bets. The maximum number of observers 
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that can be admitted is called the magic number. We investigate magic numbers for 
k-sided dice, k > 2. For six-sided dice the magic number is 5, and Table 1 gives a 
particular example where five observers can be admitted. It is straightforward (but 
tedious) to verify in Table 1 that any further linear combination would indeed 
reveal that some joint probabilities are not 1/36. Showing that it is impossible to 
construct an unfair pair of dice that would admit more than five observers is 
somewhat more laborious; Theorem 6 gives us a final answer. 


2. DEFINITION OF MAGIC NUMBERS. For a fixed integer k > 2, let (X,Y) 
denote a bivariate random variable taking values in the set {1,2,...,k} x 
{1,2,..., k} with probabilities p;_,. 


Definition 1. A linear combination Z = aX + bY is proper if its distribution is 
unchanged by setting all p,; = 1/k’. 


For any linear combination Z = aX + bY we have 
Pr[Z =m] = y) Pij> 
G, PEeZ,(a, b) 
where 
Jn(4,b) ={(i,j) EN? 1 <isk,1<j<k, ait bj =m). 
Hence Z = aX + bY is proper exactly if, for all integers m, 
k*-Pr[Z =m] = #(4,(a, b)), 

where #(/) is the number of elements in the set 7 


Two linear combinations Z, and Z, are considered identical if Z, = cZ, for 
some c € R. The trivial linear combination 0-X + 0-Y is always excluded. 


Definition 2. For k € N, k > 2, let m(k) denote the maximum number of differ- 
ent linear combinations such that the following condition holds: It is possible to 
find joint probabilities p;,, where not all p;; are equal to 1 /k*, such that all m(k) 
linear combinations are proper. The number m(k) is called the magic number for 
k-sided dice. 


For k < 4, m(k) can be found by hand, but for larger k a more systematic 
approach is needed. 


3. COMPUTATION OF MAGIC DICE, AND PRELIMINARY RESULTS. For 
k => 2 there are only finitely many linear combinations that map the points 
(i,j) € {1,...,k} x {1,..., k} into fewer than k* different points on the real line 
(which, in turn, would determine all p, j)- For instance, for k = 3 the only linear 
combinations to be considered are X, Y, X+ Y, X-—Y, X+2Y, X — 2Y, 
2X + Y, and 2X — Y. Only linear combinations with integer coefficients need to 
be considered, and a linear combination Z = aX + bY can therefore be repre- 
sented as a pair of integers (a, b). For each k > 2 the set of linear combinations to 
be considered is as follows. 


Definition 3. The set of feasible linear combinations, or feasible set, for k—sided 
dice is the set..A of pairs of integers (a, b) such that 
Gi) 0O<a<k-1, -(kK-D<b<k-1. 
(ii) ab = 0 implies a = 1 or b = 1. 
(iii) If both a and b are nonzero, then a and b are relatively prime. 
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Consider the matrix U = [u,,], where u,, = k*p,;. For a given (a,b) © A, the 
linear combination aX + bY generates a number JA,(a, b) of linear equations in 
the variables u,;. For example, for k = 3 and (a, b) = (1, 1), the five equations 


Uy, = 1 

Uy, + Uy, = 2 

Uy, + Uy + uz, = 3 

Ux, + Uz = 2 

U33 = 1 
are generated. In the N,(a,b) equations generated by (a,b) © A each of the 
variables u;; occurs exactly once, and the right-hand side of each equation is 
the number of variables on the left hand side. Using the vec—operator that stacks 


the columns of a matrix on top of each other, and writing 1, for the m—vector with 
1 in each position, the equations generated by (a, b) € A can be written as 


C(a, b)vec(U’) = C(a, b) 1,2. 
For k = 3 and (a, b) = (1, 1), we obtain 


1000000 0 0 
01010000 0 
cii,1l)=|0 0 10101 0 0 
00000101 0 
000000001 


The matrix C(a, b) is binary, of dimension N,(a, b) x k?, and has full row rank. 
For a subset Y= {(a,, b,),...,(a,, b,)} CA, let 


C(a,, 5,) 
Cy= 
C(a,; by) 
We refer to 
C,vec(U’) = Cy 1,2 (1) 


as the system of equations generated by .Y. Defining V = U — 1,1, we can write (1) 
as 


C.vec(V’) = 0. (2) 


For a given subset .Y C %, we have to solve (2). If rank(C ,) = k*, then vec(V’) = 0 
is the only solution, so U = 1,1, and p,; = 1/k?* for all (, j). If rank(C,) < k’, 
then multiple solutions exist, and ‘since vec(V’) = 0 is always a solution, we can 
then obtain another solution for which the p, ; are probabilities. The magic number 
m(k) is the largest cardinality of all subsets 7 C.A% such that rank(C,) < k?. 

In preliminary calculations for k up to 24, Gaussian elimination was used to 
determine the rank of of C, exactly. In all cases considered, whenever Y was a 
maximum cardinality subset it was possible to generate integer—valued solutions 
for U, as in Table 1. (This is always possible, as we will see at the end of Section 5). 
The computational problem was huge for the larger values of k, and it was 
necessary to program the Gaussian elimination algorithm in integer arithmetic to 
retain full precision. Results obtained in this way are summarized in Table 2. Note 
that the choice of a new linear combination to be included is not always unique. 
For example, when going from k = 5 to k = 6, we have a choice of including one 
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TABLE 2. Magic Numbers and Related Quantities for k—sided dice, 2 < k < 24. 


S = largest subset of F% such that rank[C,] < k?; C, = coefficient matrix generated by .%; m(k) = 
magic number for k-sided dice. In the column labelled .Y, each row shows only the linear combination 
introduced in addition to the ones already present in the preceding rows. Note: Magic dice for k = 23 
and k = 24 can also be constructed using the linear combinations (1, 4) and (4, 1) instead of (2, 3) and 
(3, 2). 


rank(C ,) 


(1, 0) (0, 1) 
1, 1) 
(1, -1) 


(1, 2) 
(2, 1) 


(1, —2) 
(2, -1) 


(1, 3) 
(3, 1) 


(1,3) 
(3, -1) 


(2, 3) 


(3, 2) 


ee ee ee ee 
FPRRWWNONNFPCCO TWO WA OANA AANA NA HL HW WN 


of the four linear combinations (1, 2), (2, 1), (1, —2), and (2, —1). For all values of 
k up to 24 it was possible to find k-sided magic dice with a set of linear 
combinations that contains the set of linear combinations used for k — 1 as a 
subset. We return to this point in Section 4. Table 3 shows joint probabilities for 
an example of 14-sided magic dice, admitting ten proper linear combinations. 


TABLE 3. Joint Probabilities for 14—sided magic dice. 
Columns correspond to values of X; rows to values of Y. Entries are probabilities in multiples of 
1/196. The set Y of proper linear combinations has elements (1, 0), (0, 1), @, 1), (1, — 1), G, 2), (2, 1), 
(1, —2), (2, —1), (1,3), and @, 1). 


SBR RR Be RP RP PN OR Re Re 
Se RR Be OR RP RP RP RP Nee 
BPrRrPNrFP NR RP RP OR FOR 
BP rROrRrRONONN RFF eR 
= Se Re Re RR OR FP RB ON Re bh 
PNONFRFPNFRFP FP ORF OS 
SBrmeme ON CORR RP RN RRB 
mame ROR RB RP RS ON OK 
CrerP Orr rRP NFR RK NY ON 
NOR N ORR FO RR RR Re 
PrRPrRPNNON COR FOR BR 
BSB OrRrPOrRrFP FP NRF NR Fe 
PRN RP Re RP RP RP OR Pe ee 
BR RP RP ON Be RP ee Re 


8 
7 
6 
5 
4 
3 
2 
1 
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Our first Lemma proves two intuitively reasonable aspects of magic numbers. 


Lemma 1. For the magic numbers m(k) the following holds: 


(i) m(k + 1) = mk). 
Gi) lim, .m(k) = @. 


Proof: Part (i) follows by taking a matrix U of dimension k x k that generates 
k-sided magic dice, and appending a column of 1’s and a row of 1’s. For part (ii), 
we prove a stronger result that implies (ii), namely: for any set of linear combina- 
tions Y = {(a,, b,),...,(a,, b,)} with integer coefficients there exists a k © N such 
that rank[C ,] < Ke. For any two positive integers a and b, the linear combination 
aX + bY applied to k-sided dice can take integer values from a + b to k(a + b). 
Thus such a linear combination generates at most (k — 1)(a + b) + 1 equations. 
Similarly for arbitrary integers a and b, at most (k — 1)(|a| + |b|) + 1 equations 
are generated. Let N, = la,| + |b,| G@ =1,..., p), and N= UP?_,N,. Then, for 
k-sided dice, the set ~ generates at most 


¥ [(e- N41] =(k-)N 4p 
i=1 


equations. Choosing k such that k? > (k — 1)N + p:p gives a coefficient matrix with 
k? columns and fewer than k* rows, which cannot have full column rank. fel 


4. FINDING MAGIC NUMBERS. We now describe a simple way to compute 
magic numbers. Throughout this section we refer to the first and k-th rows and 
columns of the matrix U as the margins of U. 


Lemma 2. For fixed k and for any YC &, the following two conditions are equiva- 
lent: 


(a) rankK(C ,) = k’. 
(b) At least one of the margins is uniquely determined. 


Proof: Clearly (a) implies (b). To show the reverse, let u; be the j-th column of U, 
and suppose (b) holds for the first column. Define a k Xk matrix U_, = 
(u,,...,u,,1,), and notice that Y generates the system of equations 
Cyvec(U_,) = Cy1,2. (3) 
To see why this is true, represent the k* variables as grid points with coordinates 
(i, j),1 <i,j < k, in the plane. (This is similar to the approach to be used later in 
the proof of Lemma 3). Each of the equations generated by .Y may be represented 
_ by a straight line that hits one or several points that correspond to variables. 
Formally, we may associate grid points outside the square with variables whose 
value is set to be 1. Then (1) holds as well if we shift the square by one unit to the 
right, which implies (3). By assumption, the first column vector of U_, must be 1,. 
By induction all columns of U must be equal to 1,. ri 


Lemma 2 shows that it is important to understand the conditions under which 
the margins are determined. In fact, as soon as at least one linear combination 
(a, b) with ab # 0 enters the equation system, some entries in the corners of U are 
determined (must take the value 1). In the next lemma, which is central to the 
theory, we describe the way in which the corners of U “fill up” as the number of 
linear combinations increases. 
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ae 3. Let Y* bea set of r = 1 linear combinations (a,,b,) such that both 
>1 and b, =1. Leta*=%j_,a, and b*= Xi), _,b,. If U solves the system of 
Aare (1) generated by Y*, then 


(i) uy, = 1 forj =1,...,a", andu,, =1 fori =1,...,b". 
(ii) If ris even, then 4 ata. = Ug+a.1- Pris odd, then Uy gta, + Uye 41 = 2. 
(iii) The u,; for j > a™, and the uj, for i > b*, are not determined. 


Proof: Let #* = {(a,, b,),...,(a,, b,)}. In the real plane, consider the grid given 
by all points with positive integer coefficients, and identify the grid point (, j) with 
the corresponding variable u;;. For the current purposes we may think of the 
number of grid points as unlimited. The linear combinations in .~* determine 1 to 
be the value of some of the variables associated with grid points near the origin. 
Let m, =a,/b, for h =1,...,k, and assume without loss of generality that 
m,>m,> - >m,. Forh =0,...,r, let 


a,= ) a; and B, = Db, 


i=h+1 
define 7 + 1 points 
P,=(£B, + 1, a, + 1), h=0,...,7, 


and call them the characteristic points of /*. In particular, we have P) = (1, a*+ 1) 
and P. = (b*+ 1,1). Conversely, let P, = (x,, y,), h = 0,...,7r, denote r + 1 grid 
points in N?* such that x, =1, y,=1, and the sequence m, =(y,_, —y,)/ 
(x, — x, _,) is monotonically decreasing. Then the P, determine an associated set 
S* with elements (a,,5,) to (a,,b,) uniquely by P, — P,_, = (—),,a,). Thus 
there is a one—to—one relationship between sets .7* and their characteristic 
points. : 

For a set “* with characteristic points P),...,P,. let J, be the line segment 
connecting points P,_, and P,, and denote by & the polygon obtained by joining 
the r line segments. We now show that all variables associated with grid points 
inside the area bounded by Y and the coordinate axes are determined (= 1) by 
the equations generated by “*. This implies part we of Lemma 3. See Figure 1 for 
a graphical illustration. 


No WoO FP UAW WA ~s Ww w 


— 


mn) 


QO: 2. 2 3 2h 3. 36 4 BD AO 


Figure 1. Illustration of the Proof of Lemma 3. Grid points whose associated variables are determined 
by the linear combinations in the set .7* = {(2, 1), (1, 1), (2,3), (1, 2)} are marked by plus—signs. The 
characteristic points of .~* are shown as large dots, and connected by the polygon 7. 
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We show first that the above statement implies part (ii) of Lemma 3. Denote the 
variables associated with the characteristic points Py,..., P. by &),..., €&. The line 
segment /, corresponds to an equation &,_, + &, = 2 because all other grid points 
in N? hit by the line of which J, is a segment are determined. Thus we have an 
equation system | 


été, =2 
é,+€,=2 
SPs = 2 


Successive elimination of intermediate variables gives €) + € = 2 if r is odd, and 
é) = &. if r is even. This proves part (ii) of Lemma 3. Note also that if we set any 
of the €, equal to 1, then by the above equation system all other &, are 
automatically 1 as well. 

We now prove that all variables in the area below & are determined. For each 
real s with 1<s<a‘+1 let “s) be the polygon obtained by translating 7 
down to start at (1, s) instead of at (1,a*+ 1). Let 1 =s, < +: <s, =at+1be 
the sequence of distinct values of s such that (s) passes through a grid point in 
the area bounded by and the axes. Then, by induction on i for i = 1 toi =m, 
all variables are determined in the interior of the area bounded by “%s,) and the 
axes. In passing from i to i + 1 it has to be shown that each grid point P on “s,) 
is determined. If P is the only grid point on a segment of “s,), this is obvious. If 
P and Q are two grid points on the same segment of “s,), they must be at the 
ends of the segment, s; must be an integer, and there is a chain of grid points 
along “(s;), one at each corner. The lowest of this chain of grid points is 
determined, hence so are all the others by working along the chain. 

To prove part (iii), we proceed by induction on r. Assume that (iii) hold true for 
some r > 1, and notice that no variable associated with a grid point (i, j), where 
j >a, is determined. Adding a new linear combination (a,,,,b,,,) to the 
previous set .“”* of r linear combinations, we get a new set 4%, =7* 
U{(a,,1, 5,..,)}, whose initial characteristic point is ,a*+a,,, + 1). The only 
way the variable associated with this characteristic point could be determined is 
that the variable associated with the point (b,,, + 1,a*+ 1) is determined, which 
is not the case. Similarly, the variable associated with the final characteristic point 
of .%',, is not determined. i 


By Lemma 3, the system of equations generated by .“”* imposes exactly 
a*+b* linearly independent constraints on the variables associated with points 
on the left and bottom margins: the first a* points are determined vertically, the 
first b* points in the margin are determined horizontally, which means that 
a*+b*—1 points are determined around the corner, plus the additional con- 
straint of part (i). For fixed k, consider similarly the three remaining corners of 
the matrix U. Exactly the same result as Lemma 3 is established for the corner with 
coordinates (k,k). For the two remaining corners, consider a set Y of linear 
combinations (a,,b,) where a, > 0 and b, <0, and put a = Ya,, b> = X|b, |, 
both sums extending over all (a,,b,) € Y . The same arguments as in Lemma 3 
show that the first a~ points are determined vertically, and the first b~ points are 
determined horizontally. Finally, let ° be a set of linear combinations where 
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either a, = 0 or b, = 0; there are at most two elements in ., each imposing a 
constraint on the rows and columns, respectively. Put a° = La, and b° = Yb,, 
both sums extending over all elements in .“°. Both a° and b° can only take values 
0 and 1. 

Finally, for a given set “YC Y,, we can write .~Y as the union of three disjoint 
sets Y°, 4*, and ~, as above. Let 


a*=a°+at+a = ya, 
(a,,bJESP 
and 
b*=b°+bt+b = YY |b, |. 
(a,,b5,E€F% 


By the previous considerations, exactly a* constraints exist on the first column of 
U, and b* constraints on the first row of U. By Lemma 2, all variables in U are 
determined exactly if a* >k or b* >k. If max{a*, b*} <k, then U is not com- 
pletely determined. Thus we have the following main result. 


Theorem 4. The magic number m(k) is the number of elements in a largest set 
SF CF, such that 


yy a,<k and De (Op esk: rs 
(a,,b5,)ESF (a,,b,EF% 


For example, the set consisting of linear combinations (1, 0), (0,1), (1, + 1), 
CL 2), (2, DD), Ch 3), Gye 1), 23), Crk 2) 4) 4, ee sand (3,4), 
has 21 elements, with La, = 40, ©|b,| = 41. This is a largest set such that 
max{La,, X&|b,|} < 42, and therefore m(42) = 21. The set is not unique because 
the linear combination (3, 4) may be replaced by any of (3, —4), (4,3), or (4, —3). 
Let .Y* be the set obtained by taking (3,4) out of Y and adding the two linear 
combinations (1,5) and (5, 1); then ~* has 22 elements, with La, = X|b,| = 43. 
This is a largest set such that max{La,,|b,|} < 44, and therefore m(44) = 22. 
This shows that the linear combinations contained in magic k-sided dice are not 
necessarily all contained in magic (k + 1)-sided dice. See the discussion following 
the proof of Theorem 6. 

We now show how the magic number m(k) can be expressed in terms of the 
classical Euler totient (or phi) function. For a positive integer n, $(n) is the 
number of positive integers less than n that are relatively prime to n. For 
convenience, we define (1) to be 1. Also define 


O(n) = Edi) 
and 


V(n) = Lidl), 


i=1 


and put ¥ = U,. >A, where F, is the feasible set for kK—-sided dice. For a subset 
SY of F, let og = max(La, © |b|), where the sums are taken over all pairs (a, b) € Y, 
and call .Y k-bounded if o,<k. It is convenient to view ¥ as an ordered 
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sequence of pairs in which (a, b) precedes (p, q) if 
(i) a + |b| <p + lq|, or 
(Gi) a + |b] =p + |g| and |b] — a]>|Iq! — pl. 


Further, if 0 < a < b <k, with a and b relatively prime, then the pairs (a, —b), 
(b, —a), (a, b), and (b, a) appear in that order in the sequence. The first few pairs 
in ¥, represented as an ordered sequence, are (0,1), (1,0), 1, —1), (, 1), G, —2), 
(2, -1), 1,2), 2,1), G, -—3), GB, -—D), G,3), B,D, Gd, -4, 4, -1), G,4), (4, 2, 
(2,-— 3), G,— 2), (2, 3), G,2), A, -5), 6, —D,.... 

For r > 0, define 


= a= Del 


where the sums are takén over the first 27 pairs in the ordered sequence ¥. Note 
that o,, = o, when .~ is the set of the first 2r elements in the ordered sequence. 
Verification of the following lemma is straightforward. 


Lemma 5. If ®(n) <r < ®(n + 1), then 


(i) a4, = Vn) + [r — ®(n)(n + 1). 
(ii) Any 2r-subset Y of F must have o,= 0>,; and, forr>1, any (Qr+ 
1)-subset Y of F must have og = o>, + (n + 3)/2]I. a 


Trivially, m(2) = 2 and m(3) = 3. For k > 3, Lemma 5 enables us to establish 
the value of m(k). 


Theorem 6. For k > 3, suppose V(n) < k < V(n + 1), and let p =k — V(n) — 1. 
(i) If p mod (n + 1) < [(n + 3)/2], then 
m(k) = 2®(n) + 2| p/(n + I)], 


and a maximum cardinality k—bounded subset of FY can be obtained by taking 
the first m(k) pairs in the ordered sequence. 
(ii) If p mod (n + 1) = |(1 + 3)/2\, then 


m(k) =2®(n) + 2|[ p/(n + 1)| +1, | 
and a maximum cardinality k-—bounded subset of ¥ can be obtained by 
taking the first m(k) — 1 pairs in the ordered sequence together with the pair 
(l(m + 3)/2],l(n + 1)/2). 
Proof: (i) If r= ®(n) + |[p/(m + 1], then by Lemma 5(), 
oy, = ¥(n) + (n+ Dip/(n+ DI <¥(n) +p=k-1, 


so we have a k—bounded set of the claimed size. On the other hand, by Lemma 
5(ii), any subset Y of ¥ of size > 2r+ 1 has 


Oy = Oy, + |(n + 3)/2] 
=W(n) + (n+ 1)[p/(n+ 1] + (n+ 3)/2] 
> W(n) +p — [(n + 3)/2| + [(n + 3)72| 
=k —1, 
and so it cannot be k—bounded. 
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Gi) If r= O(n) +|p/(n + 1), then, if Y is the set described, Lemma 5(i) 
gives 


Oy = Oy, + [(n + 3)/2| 
= WV(n) + (n+ 1)|p/(m + 1)| + [(% + 3)72| 
< WV(n) +p -— |(n + 3)/2| + [(n + 3)72| 
=k —1, 


so again we have a k—-bounded set of the claimed size. On the other hand, it is 
easy to verify that 


05.39 205, PN A; 
so that, by Lemma 5(ii), any subset Y of ¥ of size > 2r+ 2 has 
Op = O2742 
>W(n)+(n+1)[p/(n+1)] +n4+1 
> WV(n) +p—(pmod(n+1))+n+1 
Pk = 1, 


and so cannot be k—bounded. | 


As seen from Theorem 6, a set of k-sided magic dice can not always be 
constructed such as to contain the same linear combinations as magic (k — 1)—sided 
dice. Theorem 6 allows us also to establish the asymptotic behavior of the function 
m(k), based on the growth rate of the function ®. First we need another 
preliminary lemma. 

Lemma 7. For the functions ®(n) and V(n), the following holds: 

@ ®(n) = 3n*/7* + O(nlog n). 

Gi) V(n) = 2n3/r? + O(n’ log n). 


Proof: For part (i), see [3, pp. 448-449]. For part (ii), by summation of parts, 
W(n) = n®(n) — L772) @(i). Thus 


3 3 n-i 
V(n) = jn? + O(n’ logn) — — ¥ |i? + O(ilogi)], 
ow er | 
and the result follows. a 


Theorem 8. The asymptotic behavior of m(k) is 
m(k) = ck??? + O(k'logk), where c = 6/(27)”> ~ 1.7621. 
Proof: Given k, choose n such that 
W(n)<k<WV(n+1), 
and hence 
20(n) = m(V(n) + 1) < mk) <m(V(n + 1) +1) = 20(n + 1). 


From Lemma 7(ii) we have 


2 
k = —n’ + O(n’logn), 
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and from Lemma 7(i), 
6 
m(k) = —n’* + O(nlog n). 
17 
It follows that 
n= Gf) Re + O(log n) 
and so 
6 2 9\1/3 1173 ; 
m(k) = —|(02/2)'k' + O(log n)]| + O(nlog n). 
17 
Using n = O(k'”") leads to the stated result. qi 
Figure 2 shows a graph of exact values of m(k) along with the approximation 
nik) = 6k? /(2Q7)*/?, for 2< k < 100. As can be seen from the graph, the 


approximation is excellent. 
Before you ask, the magic number for 1998-sided dice is 280. 


40 
35 
30 
25 


magic number m(k) 
ho 
) 


0 10 20 30 40 50 60 70 80 90 100 
k = number of sides 


Figure 2. Exact values of the magic function m(k) (dots) and approximation m(k) = 6k?/? /Qa)*? 
(solid line). 


5. THE RANK OF THE COEFFICIENT MATRIX. The theory of Section 4 allows 
us to find the magic number m(k) as well as an associated set “ CF without 
solving the equation system of Section 3. For generating actual magic dice we still 
have to solve the equation system. As a byproduct of the Gauss—Jordan algorithm 
we obtain the rank of the coefficient matrix C,. In the current section we give a 
simplified method for computing the rank of C, that does not require solving the 
equation system. | 

The idea is to study how the rank increases for a given set .” of linear 
combinations if we go from (k — 1)-sided dice to k-sided dice. Let U be. the 
(k — 1) x (k — 1) matrix of variables for (k — 1)—-sided dice, and write A (instead 
of C, as before) for the coefficient matrix generated by the linear combinations in 
S. Going from k — 1 to k, we add a row and a column to the matrix U, thus 
introducing 2k — 1 new variables to get a new matrix U* of dimension k X k. 
Instead of writing the equation system for k-sided dice in terms of vec(U*’) as 
before, consider the variables contained in U as the (k — 1)? first ones, and the 
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2k — 1 new variables as the last ones. The coefficient matrix for k—-sided dice can 
then be written in partitioned form as 

, _|A B 

ar {6 2]. 

If A has s rows, then B has dimension s x (2k — 1). That is, the entries of B are 

coefficients of the newly introduced variables, but only for equations that involve 

entries from U. The matrix C contains the coefficients of all equations that involve 

only newly introduced variables; it has 2k — 1 columns and Xa, + |b,|) rows. We 
will now show how r(k — 1; .Y) = rank(A) and r(k; .Y) = rank(A*) are related. 

Perform Gauss—Jordan type row operations on the matrix A*, based on the first 

(k — 1)” columns, until as many rows as possible have zeros in the first (kK — 1)? 

entries. That is, transform A* into 


A, B, 
O B,|, 
Oo Cc 


where A, has full row rank r(k — 1; ). Because A, has full row rank, we get 
B 
rank(A*) = rank(A,) + rank vat 


Thus the rank of the coefficient matrix increases by 6 = rank x when going from 


(k — 1)-sided dice to k-sided dice. But 6 is the number of linearly independent 
constraints imposed on the newly introduced variables, i.e., the number of con- 
straints imposed by .~ on the variables in the first row and first column, say, of the 
matrix U*. _ 

In the notation and terminology of Section 4, consider the lower left corner of a 
k Xk grid, and assume max{Xa,, X&|b,|} < k, where both sums extend over all 
elements in .~ By Lemma 3, the variables associated with a*+ b*— 1 grid points 
around the corner (1,1) are determined. In addition, there is a linear constraint 
between one of the variables in the first column and one of the variables in the 
first row, making the total number of constraints due to .Y* equal to a*+ b*. The 
linear combinations in “ determine another a” variables vertically and b™ 
variables horizontally, and the linear combinations in add a° and b® con- 
straints vertically and horizontally, respectively. Thus the total number of con- 
straints is L(a, + |b,|), where the sum extends over all elements in .~Y. The rank 
increase is therefore 6 = L(a, + |b,|), independent of k, as long as k is large 
enough. 

If either La, > k or X|b,| = k, then Lemma 2 ensures that rank(A) = (k — 1)” 
and rank(A*) = k’; that is, the rank increase is 6 = 2k — 1. Finally, notice that for 
k = 1 the rank of the coefficient matrix is always 1, regardless of how many linear 
combinations are in ”. 

This is summarized in the following theorem. 


Theorem 9. Let r(k; ) denote the rank of the coefficient matrix of the equation 
system generated by the linear combinations (a,, b,) in the set *, for k—-sided dice. 
Then, for k = 2, 

r(k-—1;.7)+2U(a, +|b,|) if max{La,,L|b,|} <k, 


r(k; Y) = 
k? otherwise. 
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For a given set & of linear combinations, Theorem 9 provides a simple 
recursion to compute the rank of the coefficient matrix. In particular, if .Y is a 
largest set of proper linear combinations, then r(k; Y) = (k — 1)? + X(a, + |b, |), 
and the number of free variables in the equations system is 2k — (a, + |b,|) — 1. 
For instance, in Section 4 we gave a set of linear combinations for 42-sided 
magic dice and found m(42) = 21. Theorem 9 gives r(42; .Y) = 1762 = 427 — 2. 
For all sets of 1998-sided magic dice, the rank is 3992003 = 1998? — 1. 

Finally, we return to a question raised in Section 3: Is it always possible to find 
an integer—valued solution to the system of equations generated by ~? The 
answer is yes, by the following argument. If the coefficient matrix has rank r, then 
the values of f = k* —r variables can be chosen. Identify a free variable associ- 
ated with a characteristic point. Set the remaining (if any) f — 1 free variables 
equal to 1, and assign the value 0 or 2 to the chosen free variable. Then, working 
along chains like the polygon in Figure 1, variables are assigned values of 2 and 0 
in an alternating pattern. 


6. EPILOGUE. “Magic Dice” originated in the first author’s attempts to teach 
undergraduate students that marginal distributions do not determine joint distribu- 
tions. It is natural to ask how much more information needs to be given (in 
addition to the marginal distributions) such that a joint distribution is completely 
determined. Although the story about the magician is flawed (in the sense that it 
appears physically impossible to construct such dice), students can relate to it and 
find it entertaining. In fact, it would take a TRUE magician to make the numbers 
shown by two dice depend on each other! This distinguishes our magic dice (or 
“pseudo dice,” as suggested by a reviewer), from tricks with dice as described in 
[2]. The term “magic” may also be justified by the similarities with magic squares 
[6]. 

The idea of defining a joint distribution of two or several random variables by 
conditions imposed on linear combinations of the variables is not new; for 
example, a common definition of the multivariate normal distribution states that X 
is multivariate normal if every linear combination of X is univariate normal or 
constant. Melnick and Tenenbein give a nice example to illustrate that for any 
finite k, normality of k linear combinations does not imply multivariate normality 
[4]. For the discrete case, Rényi shows that a distribution of n strictly positive 
masses located at n distinct points in the plane, where both the sizes and locations 
of the masses are unknown, is determined by any n + 1 distinct projections, but 
not necessarily by n distinct projections [5]. Bélisle et al. study related questions 
for non-—discrete probability distributions [1]. 

Finally, it is not difficult to see that the same results hold as well for arbitrary 
distributions on a k X k grid, as long all k? probabilities are strictly positive. Up to 
Theorem 4, results also generalize easily to the situation of distributions on a 
rectangular grid of size k, X k,. It is natural to ask related questions in higher 
dimension, but to our knowledge no results have appeared in the literature. 
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It’s a kind of transitive law, isn’t it? 

when, in a house of growing children 

two people who pet the same cat are petting each other. 
Especially if one of them is holding the cat. 

Especially if both of them are holding the cat. 

And if Devin gets under the blanket with Mirage 

and lets only their heads stick out 

and smiles up in that way 

if the pug of his nose is close to that spot between Mirage’s ears 
and if I grab hold of it all 

and kiss it all... 


well, Devin also knows 

and Mirage also knows 

that something is necessary 
something is sufficient 

and something else is scared. 


Contributed by Marion Cohen, Drexel University, Philadelphia, PA 
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The Set of Differences of a Given Set 


Andrew Granville and Friedrich Roesler 


1. INTRODUCTION. A central problem of combinatorial geometry and additive 
number theory is to understand the set of sums or differences of a given set of 
vectors. For example, given a set of m arbitrary vectors A, how big is the set 
A+A:={a+b: a,b € A}, or the set A —A := {a — b: a,b € A}? By packing 
the vectors close together on a lattice one can make these sets small: for example, 
if A = {a,2a,3a,...,(m — 1)a, ma} then A +A and A —A both have 2m — 1 
elements. On the other hand, if the elements of A are appropriately spread out 
then we can make these sets large: for example, if A = {2',27,...,2”} then 
A+ A has (m? + m)/2 elements, and A —A has m? — m + 1 elements. 

It may be that the sizes of A — A and A +A are quite different: For example 
if A is the set of positive integers smaller than 10* that have only digits 1, 2, and 4 
in their decimal expansions then A has 3“ elements and A + A has 6* elements, 
far smaller than A — A, which has 7* elements. Similarly, one can construct A so 
that A+ A is far larger than A — A, for example by taking A = L*7}b,100! 
where each 5, is allowed to take any value from the set {0, 2, 3, 4, 7, 11, 12, 14}; see 
[7] and [8] for more details. For the more natural example A = {17,2’,...,n7} it 
can be shown, though with some difficulty, that A —A is about log*n times as 
large as A +A, for some constant x > 0. 

Some time ago it was realized that if either A + A or A — A is very small then 
A must have some special structure: Indeed, Freiman [5] (but see [9]) showed that 
A must then be a subset of a small part of a lattice. There have recently been 
several striking and elegant advances in this area of combinatorial additive number 
theory; see [1], [2], and [6]. Moreover Gowers has recently given a spectacular 
application of Freiman’s theorem, which proves the first reasonable upper bounds 
in Szemerédi’s theorem: Given a positive integer k and a 6 > 0, every subset of 
{1,2,...,m} with at least 6n elements contains a k-term arithmetic progression, 
provided n > N(k, 6). Gowers gave the upper bound [13] 


c.2k 
N(k,8)<2?"" 


for some constant c, a substantial improvement over bounds known previously. 
Seemingly unrelated to all this is: 


Graham’s conjecture. For any set A of m distinct positive integers, we have 


a 


TT > : 
ee gcd(a,b) — Us 


Equality holds only in the following cases: 


¢ A = {2, 3,4, 6}. 

© A={k,2k,..., mk} for some integer k. 

© A= {l/1,1/2,..., |/m} for some integer | divisible by the least common multi- 
ple of 1,..., m. 
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This old chestnut has recently been proved correct in an outstandingly original, 
though long and complicated, paper by Balasubramanian and Soundararajan [3]. 
Their method involves very careful consideration of prime divisors of linear 
combinations of numbers from A. In fact Graham’s conjecture had been elegantly 
proved for sufficiently large m by Szegedy [10] and Zaharescu [12] a decade 
earlier, but it took a wealth of new ideas to extend their result to all integers m. 

In search of a more combinatorial proof, one might approach Graham’s conjec- 
ture by asking whether there are at least m distinct integers in the set {a /gcd(a, b): 
a,b € A}? If so, Graham’s conjecture would follow easily. Unfortunately the 
answer is “no,” since for 


A = {2,3,4,6,9, 12, 18} (1 
one gets the set {1, 2, 3, 4, 6, 9}. 


Unsolved problem. For each integer m = 1, what is the least number of integers one 
can have in the set {a/gcd(a, b): a, b € A}, where A is a set of m distinct positive 
integers? 


One can relate this problem quite closely to the vector questions asked at the 
beginning of the Introduction: 

Let P1,P2.,-.-,p, be the set of primes that divide integers in A. Write each 
a € A inthe form a = p7'p$? --- p2 and associate to it the vector a = (a,,...,4,); 
note that distinct integers are associated with different vectors. Now, given 
a,b € A, evidently min{a,, b;} is the ith component of the vector associated with 
gcd(a, b). Thus a; — min{a;, b;}} = max{0,a; — b,} is the ith component of the 


vector associated with a/gcd(a, b); we call this vector 6(a, b). Thus we have: 


Unsolved problem (restated). For each integer m > 1, what is the least number of 
vectors one can have in the set 5(A) := {8(a,b): a,b € A}, where A is a set of m 
distinct vectors? : 


Remark. We can claim that this is a restatement of the first unsolved problem only 
if we state that the vectors in A all have non-negative integer entries. However, 
through a few minor technical tricks one can drop that requirement; we leave this 
as a challenge to the reader. 

To get the lower bound |6(A)| > m’'/* in the unsolved problem, we first note 
that, for fixed a € A, the pairs (6(a, b), 5(b, a)) must all be distinct since b = a — 
d(a,b) + 5(b, a), and so there are no less than m distinct pairs. Thus there are 
either > m'/* distinct values for {6(a, b): b € A} or for {5(b, a): b € A}, else there 
would be less than m distinct pairs (6(a, b), 5(b, a)), giving a contradiction. 

One can get a better lower bound for 5( A) if A is a set of vectors in the plane: 


Theorem 1. Jf ACR? isa set of m > 1 distinct vectors then 6(A) has at least 
(m/2)°/? vectors. In fact there exists a € A such that there are at least (m/2)*/”° 
distinct vectors amongst {85(b, a):a € A}. 


Perhaps such a lower bound holds in higher dimension. We postpone the proof 
of this and other results until subsequent sections. 

The example given in (1) is a translation of the set A = {(x, y) € Z*:0 <x, y 
<2,1<x+y < 3}, given by Freiman and Lev (taking n = 2, p; = 2, p, = 3). 
They generalized this to A = {(x, y) € Z*: x,y >0,L <x +y < U} with L = 
(2m)?/? — (2m)'77)/2 + O() and U = (2m)? + (m)'”)/2 + O11). Then 
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68(A) ={(4,y) EZ? :x,y>Oxn+y<U-L} vu ((t,0), 0,1) € Z’:0 <t < U}, 
which has ~ (3/2)(2m)’/? elements. Thus the lower bound in Theorem 1 is best 
possible up to a factor of 3-2'/°. Moreover from these remarks, combined with 
those directly above the statement of Theorem 1, we obtain the following partial 
result concerning our unsolved problem: 


Theorem 2. If A is any set of m > 1 distinct vectors with |6(A)| minimal, then 
(3 /2)2m)*/? > |8CA)| > m'”. 


The function a/gcd(a, b) in the unsolved problem is not symmetric in a and b. 
It thus might seem more natural to study the number of distinct values in 
{ab /gcd(a, b)” : a, b € A}. In this case we can prove a best possible result: 


Theorem 3. For any set of natural numbers A, there are at least |A| natural numbers 
in the set {ab/gcd(a, b)* : a, b € A}. | 


Remark. We show in Section 3 that the proof of Theorem 4 (which implies 
Theorem 3) can be modified to prove that equality holds only for the following sets 
A: Let q,,q2,.--,q, be positive rational numbers, with each g, =1r,/s; # 1 and 
gcd(r,, s,) = 1 such that gcd(7,s,,7,5,) = 1 if i # j. Let S be a subgroup of (Z/2Z)*. 


i9 i laa ge 
Then A is the set of integers bq;'q, -** qj* where the 1, satisfy /; <i; < u,, for 


some lower and upper bounds /; and u,, with (i,,i,,...,i,) € S, and b chosen SO 
that these numbers are indeed all integers. | 
A simple example is when A = {d:d|n} for any positive integer n. A more 
exotic example is A = {md?:dln,m = 1 or b} where squarefree b > 1 divides n. 
We may rewrite the question in Theorem 3 as a problem about sets of vectors: 


The ith component of the vector, d(a, b), associated with ab /gcd(a, b)? is 

| a; + b, — 2min{a;, b;} =|a; — 5,]. 

Therefore 
d(a,b) = 6(a,b) + 5(b,a) = (a; — b;|, [a — by|,---, |a, — 8,I) 


since a — b = 6(a,b) — 5(b, a). Thus Theorem 3 is equivalent to the following 
result (which we shall prove in the next section): 


Theorem 4. [fA is a finite set of distinct vectors in R” then D(A) = {d(a, b):a,b € 
A} has at least as many distinct vectors as A. 


These vector questions may all be thought of as problems about the set of 
distinct differences {A(a,b):a,b €.A} for some naturally defined “difference” 
function A between two vectors. Moreover, each of our questions has a number 
theoretic interpretation; the set of values {a — b} corresponds simply to looking at 
all ratios a/b of the corresponding integers. It is perhaps of interest to consider 
other difference functions that relate elementary number theory to vector prob- 
lems, though we have been unable to identify any others that are particularly 
appealing: 

Perhaps the most obvious difference function between two vectors is the 
Euclidean distance, (\a, — b,|* + |a, — b,\? + --- + la, — 6,|7)'”?. Unfortunately 
there is no straightforward number theoretic interpretation for the associated 
problem about integers. Moreover any set of orthonormal vectors has the property 
that the set of distances between pairs of vectors is {0,1}. However, Erdds [4] 
restricted his attention to a set A of m distinct points in the plane. He noted that 
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for the points in the k-by-k integer lattice, where k = Vm + O(1), we have 
#{|a — b|:a,b € A} ~ cm/ ylogm for some constant c > 0, and asked whether 
this is best possible, up to the value of c? The best result in this direction, due to 
Székely [11], is that there exists some b € A such that #{|a — b]:a € A} > c'm*”, 
for some constant c’ > 0. 


2. PROOFS OF THE THEOREMS. Although we found several proofs of Theo- 
rem 1, we decided to present here the following elegant proof communicated to us 
by Sudakov: 


Proof of Theorem 1 (Sudakov). Let x, be the smallest element of X = 
{x :(x, y) € A}, the x-coordinates of points in A, and let Y = {y:(x, y) € 4}, the 
y-coordinates of points in A. Then 6(A) contains points with x-coordinate x — x, 
for each x € X, so |6(A)| > |X| (and therefore we take a to be any element of A 
with x-coordinate equal to x,) . Similarly |6(A)| = |Y|. Thus our result is proved 
true unless |X|, |Y| < (m/2)’”, which we now assume. 

We define a series of sets A, = A DA, D -:: DA,, and then let L, be the set 
of lines of A,, that is, the sets of points {(x, y) € A,} for each x © X, = {x:(x, y) 
€ A,}, and the sets of points {(x, y) € A,} for each y € Y, = {y:(x, y) € A;}. The 
average number of points on each line in L, is r,;: = 2|A,|/(X;| + lY,|). Suppose 
there is a line in L, that has less than r,/2 points; we obtain the set A,,, by 
removing the points of that line from the set A;. Notice that r,;<7r,,,. We 
continue with this process until we reach the set A,, in which every line in L, has 
at least r,/2 >7r,/2 =m/(|X| + |Y|) = (m/2)’” points. 

Let x, be the smallest element of X,. Let y, be the smallest element of 
Y,) = {y:(x, y) € A,}, a set that is the same size as the line {(x), y): y € Yo} of 
A, and hence has size at least (m/2)'/°. Let B C A, CA be the union, over each 
y € Y,, of the lines {(x, y) € A,} of L,. Each of these lines contains at least 
(m/2)'/* elements, so that B has at least (m/2)?/? elements. Now 5(b, (xy, ¥o)) = 
b — (x, Yo) for each b € B, so that these 6 values are distinct. Therefore 
|5(4)| > |8(B)| = |BI (and a = (xp, yo)). a 


Proof of theorem 4 We use induction on n and then on the size of the set A. If A 
has just one element then D(A) contains only the zero vector, and so has exactly 
as many distinct vectors as A. If n =1 and A = {a, <a, <a,< + <a,} then 
{0, a. — a,, 4, — a,,...,4,, — a,} C D(A). This subset of D(A) has exactly as 
many distinct vectors as A, so |D(A)| = |A|. 

We may now assume that |A| > 1 and n > 1. Define 


B= {(x,...,X,-1): there exists x, with (x,,...,%,-1,X,) <A}, 

the projection of A onto the first n — 1 dimensions. Since B is a finite, non-empty 

set of distinct vectors in R"~', we can invoke the induction hypothesis to obtain 
| D(B)| =| BI. (2) 

Now, for each b € B, let 

A, = {x, :(b, x,) <A} and let a, = max{x:x € A,}, 

so A, is the set of numbers that give the mth coordinate of a vector in A when 

appended to the n — 1 coordinates of b, and a, is the largest such number. 

Finally, let C = A — {(b, a,):b © B}. That is, we get C by removing exactly one 

vector from A for each b € B, namely the vector (b,a,); in other words by 

“skimming off the highest point which projects onto b, for each b € B.” Therefore 


|C| =|A] —|B]. (3) 
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Since |C| < |A| we deduce, from the induction hypothesis, that 
|D(C)| =|C]. (4) 
We may describe D(A) and D(C) in terms of the elements of D(B) and ot 
elements of the sets A,: 
D(A) = Upe pa {(D, |a — a'|) : d(b, b’) = D with a € A, and a € A,}. 
Similarly, since C,, = A, — {a,}, we obtain 
D(C) = Upe pa {(D; |e — c'|) : d(b, b’) = D with c € C, and c’ € Cy}. 
Now comes the key observation in our argument: For any pair b,b’ € B, the 
largest difference |a — a’| with a € A, and a’ € A,,, must have a = a, or a’ = ay. 
Thus this largest difference does not appear among the set of differences 
{lc —c'|:c © C,,c’ © C,}. We deduce that, for any D € D(B), the set 
{|c — c'|: d(b,b’) = D with c € C, and c' € C,,} 
does not contain the largest element of 
{ja — a’'|: d(b,b') = D with a € A, and a’ € Ay}, 
so it is a proper subset, and is thus smaller. Comparing the sizes of D(A) and 
D(C), and taking this observation into account, we obtain 
ID(C)|< \ (#{la —a'|:d(b,b’) = D with a € A, and a’ € Ay} — 1) 
DeED(B) 
<|D(A)| —|D(B)|. (5) 
Combining (2), (3), (4), and (5) gives 
|D(A)| =|D(B)| +|D(C)| =| B] +1C] =[Al, 
as required. 


3. WHEN DOES EQUALITY HOLD IN THEOREM 4? As we indicated at the 
beginning of the Introduction, equality typically holds in inequalities such as 
Theorem 4, only when the vectors in A form part of a lattice. Let 1, be the set of 
all subsets J of Z* of the form RM A, where R is some rectangular box with sides 
parallel to the axes, and A is a lattice with (2Z)* c A C Z*. More specifically the 
points (i,,i,,...,i,) © S, with each i, contained in some interval [J;, u;], and S a 
subgroup of (Z / 20). One can easily verify that equality holds in Theorem 4 for 
any A € I,; in fact equality holds if and only if A is a suitable translation of some 
rel,: 


Proposition 1. If A C R” with |DCA)| = |A| then there exist a,v,,Vv>,...,V, € R”, 
wie the gih component of v,; non-zero for at most one j for each g, such that 
= {a+ Li_,iy;: ii, Jg5. 6 iD € I}, where I € 1,. 


Before we indicate how to deduce this from our proof of Theorem 4, we first 
note the following results: 


Proposition 2. [fA and B are two sets of distinct real numbers then #{\a — b|:a © 
A, b € B} = min{|A|,| Bl}. If equality holds then either 


¢ A = Bis an arithmetic progression; or 
={a+ Qi+ 1)d:1<2i+1<2N-— 1} with A, ={a+2id:0 <2i< 
2.N}, and then A is either Ay, or Ay less any one element; or 
¢ A ={m—a,m +a} and B = {m — b,m + b} for some a > b > 0. 
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The inequality in Proposition 2 can be proved by taking c to be the smallest 
number from either set (say from B), and then by noting that the numbers 
a—c,a €A, are distinct, positive real numbers. That the enumerated cases are 
the only ones in which equality holds may be proved by induction on min{|A|,| Bl}, 
using the induction hypothesis on the smaller sets created by removing the largest 
number from each of A and B. 


Proposition 3. Suppose that I <1, and that f: I — R is a map for which \f@ — 
f(j)| is a function of d(i,j) only. Then there exist constants a and B such that, for 
every i EI, either fli) = a + Bi; for some fixed j, or fi) = a or B depending on 
whether or not i € T, where T is a subgroup of S of index 2. 


Proposition 3 is easily proved by induction on k. 


Sketch of the proof of proposition 1. If |A| = 1 then clearly |DCA)| = 1. If n = 1 
then A is an arithmetic progression, by Proposition 2. We now proceed with the 
same induction as in the proof of Theorem 4, and then by induction on h(= h(A)), 
the maximum of |A,|,b € B. For h = 1, we know that B has the structure stated 
in Proposition 1 by induction, since |D(B)| = |B| as in the proof of Theorem 4. 
Define f(i) = a, where b = a + L*_,i;v,. The result follows from Proposition 3. 

Now suppose h > 1 and write A“ =A. By the proof of Theorem 4 we know 
that |D(B™)| = |B™| and |DCA™)| = |A™| where B® = B and A® = C. We 
now apply the proof of Theorem 4 to A® and then to A®, etc., to find a sequence 
of sects AY 5A? D5 +» > AM with each h(AM) =h + 1—j and |D(A”)| = 
|AD)|, Note that BY = {b © B: |A,| = jh. 

We first prove Proposition 1 for A* = {(b,x):b € B™, x € A,}. From the 
proof of Theorem 4 (taking D = 0) we see that there are no more than h elements 
in the union of all sets {|a — a’'|: a,a’ € A,}. If |A,| = then, by Proposition 2, 
A, must be an arithmetic progression; and the arithmetic progressions for any two 
such sets must have the same common difference. Moreover if |A,| = |A,| = 
then #{|a — a’|:a € A,, a’ € Ay} <h by the proof of Theorem 4, and so either 
A, =A, or they are two disjoint, but interwoven, arithmetic progressions, by 
Proposition 2. Thus the sets A, of size h are either all identical (in which case A* 
satisfies Proposition 1 by the induction hypothesis), or there are two possible such 
sets A,. In this case we may write each b=a+t L* j-14;V; by the induction 
By poieds and let f(i) be the smallest element in A,. By Proposition 3 we deduce 
that Proposition 1 holds for A*. 

The result thus holds when A* = A, that is, when there are h elements in every 
A,, or equivalently when B® = B™ If not then there exists b € BY \ BY for 
some j,1 <j <h-—1. Let b’ be any point in BY*” and D:=d(b’,b). The 
induction hypothesis ensures that each B is a lattice as described in the 
hypothesis of Proposition 1, and this particular lattice structure implies that there 
do not exist 6, B’ € BY*” with d(B, B’) = D. Therefore #{\a — a'|:a € A,, a’ 
€ Ay} = |A,|, by the proof of Theorem 4. Using Proposition 2 we deduce that A, 
and A,, are interwoven disjoint arithmetic progressions, whose union is also an 
arithmetic progression. Thus h(A,) =j + 1, so taking b’ €e B™ we see that 
j =h —1, and moreover all such sets A,, b’ € B™ must be the identical arith- 
metic progression. But the same argument applies to every b € B“’~”, and so each 
such A, is the same arithmetic progression interwoven between the elements in 
each A,, with b’ € B™. Thus Proposition 1 holds for A. 
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4. FURTHER QUESTIONS. Proposition 2 inspires, and provides the answer in 
one dimension to, the following open problem: If A and 8B are finite sets of 
distinct vectors in R” then show that the order of the set D(A, B) = {d(a,b):a € 
A,b € B} is at least min{|A|,| B|}. This translates to finding a lower bound for the 
order of {ab /gcd(a, b)*: a € A, b € B} where A and B are sets of distinct positive 
integers. 

Probably even more difficult would be to find a good lower bound for the order 
of {a/gcd(a, b), b/gcd(a, b): a € A, b € B}. Generalizing Graham's Conjecture, 
we conjecture that the largest element of this set is > min{|A|,| B|} (the authors 
of [3] have informed us that they retract the claim at the end of the introduction to 
[3], which would have implied this conjecture). 


ACKNOWLEDGMENT. We thank Seva Lev and Carl Pomerance for their helpful comments, and 
Sudakov for communicating his proof of Theorem 1. 


REFERENCES 


1. N. Alon, M. Nathanson, and I. Ruzsa, Adding distinct congruence classes modulo a prime, 
J. Number Theory 102 (1995) 250-255. 
2. Y. Bilu, Addition of sets of integers of positive density, J. Number Theory 64 (1997) 233-275. 
3. R. Balasubramanian and K. Soundararajan, On a conjecture of R. L. Graham, Acta Arith. 75 
(1996) 1-38. 
4. P. Erdés, On sets of distances of n points, Amer. Math. Monthly 53 (1946) 248-250. 
5. G. Freiman, Foundations of a structural theory of set addition, Transl. Math. Monographs 37 
(1973). 
6. V. Lev, Structure theorem for multiple addition and the Frobenius problem, J. Number Theory 58 
(1996) 79-88. 
7. I. Z. Ruzsa, Sets of sums and differences, Sém. Théorie Nombres Paris (1984) 267-273. 
8. I. Z. Ruzsa, On the number of sums and differences, Acta Math. Hung 59 (1992) 439-447. 
9. I. Z. Ruzsa, Generalized arithmetical progressions and sumsets, Acta Math. Hungar. 65 (1994) 
379-388. 
10. M. Szegedy, The solution of Graham’s greatest common divisor problem, Combinatorica 6 (1986) 
67-71. | 
11. L.A. Székely, Crossing numbers and hard Erdds problems in discrete geometry, Combin. Probab. 
Comput. 6 (1997) 353-358. 
12. A. Zaharescu, On a conjecture of Graham, J. Number Theory 27 (1987) 33-40. 
13. W. T. Gowers, A new proof of Szemerédi’s theorem for arithmetic progressions of length four, 
Geom. Funct. Anal. 8 (1998) 529-551. 


ANDREW GRANVILLE is the Barrow Professor of Mathematics at the University of Georgia. He is a 
regular contributor to the MONTHLY, and was a recipient of the MAA’s Hasse prize in 1995. His main 
research is in number theory and related topics. 

University of Georgia, Athens, Georgia 30602-7403 

andrew @math.uga.edu | 


FRIEDRICH ROESLER Harvard class of ‘68, was born in Saxony at the end of the second world war. 
He studied in Miinster (Germany), Cambridge, and London and received his Ph.D. from the University 
of Miinster. Since 1983 he has been a professor of mathematics at the Technical University of Munich. 
His current research interests lie in number theory and algebra, 

Zentrum Mathematik der Technischen Universitat Miinchen, Arcisstrasse 21, 80290 Miinchen, Germany 
roesler@mathematik.tu-muenchen.de 


344 THE SET OF DIFFERENCES OF A GIVEN SET [April 


Six Ways of Looking at Burtin’s Lemma 


S. Anoulova, J. Bennies, J. Lenhard, D. Metzler, 
Y. Sung, and A. Weber 


In an article in this MONTHLY in 1953 Metropolis and Ulam asked for the expected 
number of components of the graph induced by a purely random mapping of a set 
of 1 points into itself [7]. This problem was solved one year later by L. Kruskal [6]. 
In 1955, L. Kac [4] computed the probability that this random graph is connected, 
that is, that the number of components is 1. In 1981, S. Ross [9] treated the same 
questions for more general random mappings in which the function values are 
independent and identically distributed but not necessarily uniform. The funda- 
mental lemma of [9] had been proved earlier by the young Russian mathematician 
Y. D. Burtin, several months before his death in 1977 [2, Prop. 1]: 


Burtin’s lemma: Let F(1),..., F(m) be independent and identically distributed 
with P{F(i) = j} =p;, j =0,...,n. Generate a random directed graph [ with 
vertices {0,..., nm} by drawing an edge from i to F(i) for each i = 1,...,n. Then 
this graph is connected with probability pp. 

The proofs of Burtin’s lemma given in [2] and [9] both use induction on n, 
lumping together all those vertices directly connected to 0. In 1984, still another 
inductive proof was given by Jaworski [3]. The six ways of looking at Burtin’s 
lemma presented here arose from the attempt of each of us to understand it 
better. 

Burtin’s lemma has interesting connections with the random generation of 
spanning trees. Assign to each pair (i, j) 1 <i <n, 0 <j <n) the weight p,, and 
consider the problem of generating a random spanning tree on {0,..., } with root 
0 such that the probability of a tree is proportional to the product of all its edge 
weights. Propp and Wilson [8] describe how to do this in a straightforward way: 
Start at an arbitrary element i of {1,..., 7}, and consider the iterates F(i), F7(i),... 
until the first time they hit 0. Delete the cycles of the path from 7 to 0 and take 
this as the trunk of the tree. Then take an element j not on the trunk (if there are 
any), proceed in the way described until you hit the trunk, delete the cycles, and so 
on. Burtin’s lemma then tells us that the probability of never producing a cycle in 
all these attempts is just py. In fact, our Proof 4 computes this probability directly. 
Proof 5 provides a more economical algorithm for generating a random spamming 
tree with the desired distribution. 

Propp and Wilson [8] also treat the more general scenario in which the weights 
of an edge (i, j) can be of the form p;,; rather than p; only. They also. relate the 
problem of generation of random spanning trees to that of exact simulation of the 
equilibrium of a Markov chain by their algorithm of “coupling from the past.” In 
[8, Sec. 1.3], they give a short history of random spanning tree generation, 
including references both to the algebraic method (which relies on variants of the 
so called matrix tree theorem [1, Ch. 2, Thm. 8] and the method using Markov 
chains. 
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And now to the proofs. Let A (for “acyclic’”) be the event that I contains no 
cycle. One easily sees that A can also be described as the event that 


1) T is a (directed) tree; or 

2) T is connected (as an undirected graph); or 

3) there is a directed path from each vertex i =1,...n to 0. We then say 
“Rach i = 1,...n chooses, directly or indirectly, the vertex 0.” 


The first proof is a direct computation of Pr(A). 


Proof 1: We enumerate all graphs from each of whose vertices 1,..., all choose, 
directly or indirectly, the vertex 0. Let m, be the number of immediate predeces- 
sors of the vertex i. Obviously m, = 1 and L7_)m, =n. 

Construction: 


a) Given {m,}, choose successively for each vertex i = 1,...,n its m,; immedi- 
n — 
m 


ate predecessors. For i = 1 there are _ possibilities; F(1) = 1 is not 


allowed since this would create a cycle. For i = 2 there are two cases: 
Gi) If F(2) =1, then F(1) = 2 is not allowed and there are (" es: 7 ; 


2 
possibilities. 


(ii) If F(2) # 1, then F(2) = 2 is not allowed and there are also ‘ 7 i, 7 ‘ 


2 
possibilities. 
HM, ** ~My_ 3 ; 


This argument holds for all k so in general there are ( ie 
k 


possibilities to choose the m, immediate predecessors of vertex k. 
b) The m, vertices that are left are the immediate predecessors of the vertex 0. 


In all there are 


_ n—-1 
~ (my —1,m,,...,m, 


possibilities to construct such a graph for given my,..., m,. 
Summing over all m, we get 


n—1 _ Fe 
Pr( A) 7 > or Lehi ating se 2 n 


Mo=1, 
Lum,=n 


n—-1 
=Po( Pot: +P.) =Do- a 


Proof 2 is inductive and works by lumping together an individual n and the one 
it chooses. 


Proof 2: Suppose the proposition is proved for n — 1 > 2 individuals; for n = 2 
(exactly one voter) it is trivial. We partition A according to the choice of individ- 
ual n. 
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If F(n) = 0, we combine the individuals 0 and n into one new individual and 
start again with n — 1 individuals. By the induction hypothesis the contribution of 
this case to Pr(A) is po( po + p,): 


If F(n) =), 7 € (1,...,n — 1}, we combine individuals j and n and obtain 
probability weights po, P),-.-; DP; + Pys-+++>Pn—1> hence this case contributes the 
probability p; po. 


Finally, if F(n) =n, we have a cycle, so this case contributes nothing. 
Summing over all these possibilities we get: 


Pr( A) = po( Po + Pa) + 


n—-1 
3 Pj 
j=l 


po= pal = Do. fl 
j=0 


The next proof is also by induction, but in this case we show that lumping 
together individuals n — 1 and n does not change Pr(A). Iterating this argument 
then proves the proposition. 


Proof 3: We imagine that only the individuals 1,...,” — 2 have already chosen 
their successors. 
We consider two scenarios: 


(i) with individuals n — 1 and n left to make their choice, 

(ii) with the composite individual (m — 1)’, which is obtained by lumping 
together individuals n — 1 and n. This individual has weight p,,_4) = 
Pn-1 a Pn: 


Obviously the probability that there is no cycle so far is the same in both 
scenarios. | 

Let 7, be the set of all individuals who have, directly or indirectly, chosen k. At 
this moment, in (i), 7), 7,_,, and 7,, are disjoint and are trees since I’ so far 
contains no cycles. In (ii) the same holds for 7) and T,,_ 1. 

In scenario (i) a cycle in I would be created if F(n — 1) € T,_, or F(n) € T, 
or {F(n — 1) € T, and F(n) € T,,_,}. The probability for this is §,_, + S,, where 
S,—, and S, designate the sums of the weights of all vertices (including the root) in 
T,_, and 7), respectively. 

In scenario (ii) the choice of (nm — 1)’ produces a cycle if and only if F(n — 1)) 
€ [(,-1. The probability of this is S(,-3) = 8, + $,-1- as 


In Proof 4 we consider the stochastic. process of the successive choices of the 
individuals. 


Proof 4: Let the n individuals make their choices in succession beginning with 
individual n and ending with individual 1. Let C, be the event that individual k 
completes the first cycle. We claim: 


Pr(C,) = 


in 
= 
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If this claim holds, the proposition follows by an easy calculation: 


Pr(A) - P| A cf = 7 1- 


n—-l n—2 
pe Dap. 
eee | eee Po ; 
= n n—-l BY 0° 
yp; y p, Pot Pi 
i=0 i=0 


We check the correctness of the claim. Suppose that the individuals {n,...,k + 1} 
have not completed a cycle: this is the case if and only if each of them has chosen 
(perhaps indirectly) one of {0,..., k}, otherwise a directed path along the arrows 
would start from this individual without reaching any element of {0,..., k}, and 
would therefore be a cycle. For example, in Figure 1, n chooses (n — 2),(n — 1) 
chooses k, and (n — 2) chooses k (indirectly). 


Figure 1 


The event that individual k now completes the first cycle consists of two parts: 


Gi) F(k) =k, which bates with probability p,. 

Gi) F(k) € {k + 1,...,n}, and F(k) has chosen (perhaps anearestly) k; the first 
of these occurrences happens with probability U?_,,, p;. As to the second 
occurrence, consider the path starting at F(k). The probability that it hits 
the set {0,...,k} in k is p,/X*_y p;. Therefore: 


= Pr Px 
Pr(C,) =p,p + DD; =: a 


k 
a LP; » DP; 
j=0 i=0 


Proof 5 gives a bijection between cycle-free graphs of F and graphs where 
F(1) = 0. It also provides an economical algorithm for generating a random 
spanning tree with root 0. 


Proof 5: Consider the set B of all graphs where 1 — 0 is an edge. B occurs with 


probability py. We want to prove Pr(A) = Pr(B). We do this by constructing a 
bijective map f: B — A with the property Pr( f(b)) = Pr(b) for all b € B. 
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Consider a graph b € B. For example: 


The graph defines a partition into cycles of all the vertices that lie on a cycle. In 
our example: (753)(6). 
Now we define f(b) € A by assigning the following graph to the corresponding 


permutation (753)(6) = ( = ? : : 


Both graphs occur with the same probability, namely pj p,p3P3PsPeP7, be- 
cause we do not change the number of edges that point toward each vertex. 

The example describes how to construct a measure-preserving bijection (each 
permutation has a unique cycle-representation) for each selection of vertices that 
builds the cycles or the path between 1 and 0. Together the bijections define the 
desired f. | 


Proof 6 uses the fact that a cycle-free F codes a tree with root 0, and also gives 
a simple algorithm to generate a random spanning tree in a special case. 


Proof 6: The connected graph without cycles generated by F (when A occurs) can 
also be considered as a labeled rooted tree. In this context it is more natural to 
think of F(i) as the predecessor rather than the successor of i. Each vertex i has 
exactly one predecessor F(i). The root is zero. 

According to Knuth [5, p. 389] we can code each rooted {0,..., m}-labeled tree 
f by avector K(f) = (K,(f),..., K,(f)) € {0,..., n}", see Fig. 2. 

The following proof is based on the equation 


Pr(A)= ) Pr(F=K™'(x)) 


and the simple structure of the summation range. Three properties of K are 
important for the computation of Pr(F = K~'(x)): 


(i) K,(f) is the label of the root of f. 
Gi) The number of children of a node equals the number of occurrences of its 
label in the multiset [K,(f),..., K,(f)I. 
(iii) Every x € {0,..., }” codes a tree. 
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fM f ©) f() fA) f (8) fQ) fQ) f@G) 
1 1 3 8 0 3 3 0 


Figure 2. Knuth’s coding of a rooted {0,..., n}-labeled tree f into K(f) € {0,..., n}”": We get the list 
of numbers K,(f),..., K,(f) by iteratively removing the highest labeled leaf and noting the label of its 
predecessor. In each iteration step, a node is considered to be leaf, if all its children are already 
removed. 


From (i) follows that Pr(F = K~'(x)) = 0 for K,(f) # 0, because if F is a tree 
at all, it has 0 as root. For every rooted labeled tree f, (ii) implies equality of the 
multisets [f(1),...,f(m)] and [K,(f),...,K,(f)]. Let T be the set of all 
{0,..., n}—labeled trees with root 0. We obtain: 


Le Pr(F =f) = ) Pray Paw = Lo Pry Px 
feT feT feT 


= LPs, Px, = (Pot +a)” “Po = Po- 
x€{0,...,n}"~-!x {0} 

Note that this also gives an algorithm for generating a random spanning tree: 
Just take K~'(X,,...,X,) with X,,...,X, independent and identically dis- 
tributed according to the weights of the random mapping F. 

The tree can be recovered from the sequence by successively assigning to the 
Kj f)’s their successors Ki(f). Remembering the rule of construction of the 
K,(f)’s, one gets the arrangement illustrated in Figure 3. 


7 
Ki = 6 ye 
1 


6 7 
=e aN \Z 
: 1 


Figure 3 
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Formally 
K} = max{{1,...,n} \ (Kj,..., Ki_,,K;,,...,K,}}, lsisn, 
in our example: K,, K4, K4,..., Kg = 7, 6,5, 4, 8, 2, 1, 3. a] 


ACKNOWLEDGMENTS. We thank Anton Wakolbinger and Brooks Ferebee for comments that have 
greatfully improved the presentation. 
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NOTES 


Edited by Jimmie D. Lawson and William Adkins 


Lexell’s Theorem Via an Inscribed 
Angle Theorem 


Hiroshi Maehara 


We present a simple inscribed angle theorem in spherical geometry, and apply it to 
give a short proof of Lexell’s theorem. 


Theorem 1. For any spherical triangle ABC inscribed in a fixed circular arc T with 
end-points A, B, the value of 7C —(ZA+ ZB) is constant. 


Figure 1 


Proof: Let O be the center of the spherical cap determined by I. Then, since 
the base angles of a spherical isosceles triangle are equal, it follows easily 
from Figure 1 that 

ZO =(LA+ 2B) = £22 0AB, 
where the sign is + if [ is a minor arc, and — otherwise. a) 


Let |ABC| denote the area of a spherical triangle ABC on the unit sphere. 
Then by Girard’s formula, we have |ABC| = 2A+2Z2B+2ZC- 7. 


Theorem 2 (Lexell). Let ABC be a spherical triangle on the unit sphere, and let # be 
the hemisphere bounded by the great circle AB and containing C. Then the locus of the 
point X © # satisfying |ABX| = |ABC| is the circular arc A*CB*, where A*, B* are 
the antipodal points of A, B, respectively. 


Proof: It suffices to show that |ABX| = |ABC| for any point X on the circular arc 


A*CB* (X # A*, X # B*), By theorem 1, we have 2 A*CB* — (2 CA*B* + 
Z.CB*A*) = Z A*XB* — (2 XA*B* + ZXB*A*). Since ZA*XB* = 2 AXB, 
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ZL XA*B* = 7 — ZXAB, 2 XB*A* = 7 — ZL XBA, we have 2 AXB + Z BAX + 
ZL ABX = 2 ACB + Z BAC + Z ABC. Hence, by Girard’s formula, we have 
|ABX| = |ABC|. a 


For a different proof of Lexell’s theorem, see L. Fejes Toth, Lagerungen in der 
Ebene auf der Kugel und im Raum, Springer-Verlag, Berlin, 1972, p. 23. 


College of Education, Ryukyu University, Nishihara, Okinawa, 903-0213 Japan. 
hmaehara@edu.u-ryukyu.ac.jp 


A Characteristic Property 
of Differentiation 


Khristo Boyadzhiev 


We offer here a simple exercise in calculus with a flavor of functional analysis. The 
differentiation operator D: f — f' is a fundamental operator in calculus and it is 
interesting to consider what properties distinguish it from all other operators on 
functions. One important theorem says that if a differentiable function f(x) has a 
relative minimum (or maximum) at x = a, then f’(a) = 0. As we shall see now, 
this property “almost” characterizes D. 


Notation. For convenience we consider only polynomials. Let P be the set of all 
polynomials and let p,, n = 0,1,..., be the basic polynomials: 

Po(x) = 1, pix) =x,...,p,(%) =xX",.... 
When 6: P > P isa linear operator, we denote its action on p € P by 6[ p]. Thus 
5[ p] is again a polynomial and its value at some number x is written as 6[ p\(x). 


Theorem 1. Let 5: P — P be a linear operator. Then the following are equivalent: 


(i) If p has a relative minimum at x = a, then 8| pa) = 0. 
(ii) 6 = d[p,]D. 


In particular, if 6[p,] = po, then 5=D. (Here “minimum” can be replaced by 
maximum.” 


Proof: The implication Gi) — (i) is immediate, so we focus on (i) > (ii). First we 
want to show that every linear operator on P has a convenient general form. By 
Taylor’s formula, for any polynomial p and any number a: 

= P(a) 


p(x) = E(x ah 


The sum is finite and we write “oo” just for convenience. Applying 6 to both sides 
(as polynomials of x, with a fixed) we obtain 


oo (k) a 
s[p]= X= scx @)*] (1) 
k=0 
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where 8[(x — a)*] are polynomials of x. Setting 


1 k 
g(a) = 7 8|(x - 4) "](a), (2) 


1.e., evaluating these polynomials at x = a, we define new polynomials g,, k = 
Oi Berea | 

It is clear that (2) defines functions of the variable a, but why are these 
functions polynomials? Good question! To answer it we use the binomial formula 
for (x — a)* and the linearity of 6. Now (1) can be written in the form 


S[ pl(2) = b g(a)p(4) 
k=0 

or simply 

S[p]=go + ip + 8.p t+. (3) 
This is the representation we need: the action of 6 is expressed in a simple way in 
terms of the polynomials g,. Notice that we did not use here property (i). 
Therefore, the general representation (3) is true for every linear operator on the 
polynomials. 


It turns out that under condition (i) we have g, = 0 for every k > 1. Indeed, 
consider the polynomial 


1 2 A k 
f(x) = 5(x- 4)? + F(x- a) 


which is specially designed to serve our purpose. Here a and A are arbitrary real 
numbers and the integer k > 2. We have f’(a) = 0, f”(a) = 1, so f has a relative 
minimum at x = a. According to property (i) 

8[ f](4) = g2(4) + Ag, (a) = 0 
Since this is true for all A and a, we conclude that g, = 0 identically for all k > 1. 


Also, go = 5[ po] = 0, as py has minimum at each number. Finally, using the 
linearity of 6 we obtain 


g(a) = d[(x - a)](a) = 8[ p, — apo](a) 
= 6[p,](2) — 48[ po ](4) = 5[p,](4). 
Therefore, the representation (3) turns into 


6[p] = d[ pi ]p' 
for every polynomial p. | c 


The same proof gives the following. 


Theorem 2. Let 6: P — P be a linear operator with the following property (Minimum 
Principle): 5| pa) = 0 whenever a polynomial p has a relative minimum at some 
number x = a. Then 5[ p| = g,p' + gp" for all p € P, where the polynomials g,, g» 
are defined in (2) and g, = 0. 


Theorem 2 naturally extends to polynomials of many variables: any linear 
operator 6: P — P satisfying the Minimum Principle is a second-order elliptic 
partial differential operator. For instance, Markov processes (semigroups) in 
diffusion theory have generators that satisfy the Minimum Principle [1]. There- 
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fore, we conclude that diffusion in nature is governed by second-order elliptic 
partial differential operators. Theorem 2 (in a different form) originates from 
A. Kolmogorov. Some others contributed to it, providing modifications and exten- 
sions: comments and references can be found in [1, Chapter 5] and [2, Chapter 
XIII]. 
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A Weighted Mixed-Mean Inequality 


Kiran S. Kedlaya 


In [4], the author established the following inequality conjectured by Holland [3]. 
Unbeknownst to either of these parties, the same inequality had been earlier 
announced by Nanjundiah [8] without proof. 


Theorem 1. Let x,,x,,...,x, be positive real numbers. The arithmetic mean of the 


numbers 
n 
X15 V*1%2 pe 00g V%1%2 eee Xn 


does not exceed the geometric mean of the numbers 
X, +X, Rh Td 
Br es 


Equality holds if and only if x, =x, = ++ =x 


X15 
n° 


Here we prove the following weighted extension of Theorem 1. 


Theorem 2. Let x,,...,%X,,W1,-..,W, be positive real numbers, and define s,; = w, 
+++ +w, fori =1,...,n. Assume that 
Wi W> w 
—>— > 3. (1) 
Sy So Sn 


Then the weighted arithmetic mean of the numbers 


X15 YI S20 2/82, Bens PI Sayer $n eee xWalSn 


does not exceed the weighted geometric mean of the numbers 


Wi W2 Wi W2 Wr 
Nig ey a Nee A Tk a eee 
2 So Sn Sn n 
when each mean is taken with weights W,/S,,,W>/Sp>+++>W,/S,- In other words, 
n iw, Wi/Sn nw, J 
J J w;/S; 2 
Tj Lx) = LTT. (2) 
i=1 \ j=1 Sj j=l °n i=l 
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Equality holds if and only if x, = ++ =X,,. 
The constraint (1) might not be the weakest possible, but some constraint is 
definitely necessary; for example, one needs to have 


Wi Wa Wn-1 Sn-1 
— — eee Fearne anos > es 
Sy So Sn-1 Sn 


or else (2) fails for x, >> x,_, >> **: >> x,. Preliminary calculations suggest that 
this condition might even be sufficient, but a proof seems difficult. Theorem 2 is 
asserted without any condition on the weights in [1, pp. 122-123]; of course the 
proof given there is incorrect. 

The ingredients of the proof of Theorem 2 are the same as in [4], except that we 
use induction to simplify the computations; one may unravel the induction to 
obtain a proof that, in the case of equal weights, coincides with the proof in [4]. A 
different inductive proof of Theorem 1, using Lagrange multipliers, appears 
in [5]. 


Proof: We prove Theorem 2 by proving an analogue of Rado’s inequality [2, 
Theorem 60] in this setting. Namely, if L, and R,, denote the left and right sides 


of (2), we prove that 
L. Sp ’ ae Sn-1 : 
—| > 
ee a (3) 


for n > 1. We note in passing that a similar argument gives an analogue of 
Popoviciu’s inequality [9]: 


S,(L, =. R,,) = Sy—1( Las ~ R,,-1). 
Unraveling (3), we see that it is equivalent to 


n Ww: Wh n-1 Sn-1 n Ww. j=1 S,, 
a es tized ee BE) od 
i=1 


j= Sn j=1 Sn-1 i= j=1 Sn 


We prove this nee in two steps. First, we observe that 
n-1 
j=l 

since W,S, = W,S; by (1), we may apply the weighted arithmetic-mean, geometric- 

mean inequality to each summand on the right side and obtain 


WS, — W,S; 
K) 


L Tans = > “1 pens 4 Nees 1%n The 7 


Sy- 1 i= j=l n—-15n i=1 Sn 15, i= 


WjSp —wWw n5j w;s 


n—-1 J nw, = far —— 
[ [x75 > 3 —x; j5n-1 ieee nd (5) 
j=l Sy—1 i=1 j=l °n i=1 


Second, we apply Holder’s inequality to get 


i W; ae ae — be W; Wal Ss Le WwW; 
pire 7 [TT x; » Pe = 25 rail . (6) 
ak , 


a 


Together, (5) and (6) imply (4), and (2) now follows by induction on n (since 
equality vacuously holds for n = 1). The equality condition also follows by induc- 
tion: if equality holds in (2), then equality in (3) forces x, = «+ =x,_, by 
hypothesis, and equality in (6) forces x, = x,. 
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We mention here three ways in which Theorem 2 can be extended easily. First, 
one can replace the arithmetic and geometric means by the r-th and s-th power 
means, respectively, for any r > s; the corresponding analogue of Theorem 1 is 
formulated in [6]. Recall that for r # 0, the r-th power mean of x,,...,x, with 
weights w,,...,W, 1S given by 


Extending by continuity to r = 0 yields the weighted geometric mean. The state- 
ment of the inequality then becomes 


i s/r\ 1/s — es ex ll? 
be | Say | 2(£2(2 i | , 


i=1 Sn 


the Rado and Popoviciu-type inequalities become 

§(L, Ry) Ss,.4(Lia Ray hao; 
and the proofs carry over upon replacing the weighted arithmetic-mean, 
geometric-mean inequality by the weighted power mean inequality, and Holder’s 
inequality by Minkowski’s inequality [2, Theorem 24]. This last inequality appears 
to hold for all k € [s,r], but I do not have a proof. 

Second, one can prove a analogue of Theorem 2 for Hermitian matrices, using 
the arithmetic and harmonic means, following Mond and Peéari¢ [7], who proved 
such an analogue of Theorem 1 using a matricial Minkowski inequality. 

Third, one may use a straightforward limiting argument to deduce the following 
continuous analogue of Theorem 2. We leave the formulation of the corresponding 
power mean generalization to the reader. 


Theorem 3. Let f(x) and w(x) be continuous positive-valued functions on [0, 1], and 
let W(x) = {jw(t) dt. Assume that w(x)/W(x) is aaa on (0, 1]. Then 


1 [{w(y) py w(x) | 1w(y) | ; 

exp f(x) d&| dy= log f(x) dx | dy. 
| om ef, yf way fa a 

Finally, we use Theorem 2 to generalize a well-known inequality of Carleman: 
for a sequence {a,}"_, of positive real numbers with La, < ©, 


D (a, a)" <e YS ay. 

k=1 k=1 
It was observed in [3] and in [8] that this inequality follows from Theorem 1. We 
refine this observation slightly to obtain a weighted version of Carleman’s inequal- 
ity. Surprisingly (to the author, at least), the constant on the right side does not 
depend on the weights! 


Theorem 4. Let w,,w,,... be a sequence of positive real numbers, and define 
S;=W, ++ +w, fori =1,2.... Assume that 

Wea: «9 

a 

Sy 52 
Then for any sequence a,,a,,... of positive real numbers with L,.W,a, < ©, 


co 


M4 8 


way’ sk ae anes Sk <e > W, ay. 


k=1 k=1 
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Proof: Taking x, = a, for k = 1,...,n in Theorem 2, we obtain 


Of course Df_,w,a; < X%_,w,a;, and so 


nee © 


n 

Wi/Sk cee qQWe/ Sk Pree Se (RTE 
as We Oe s M17 Sn eee ghn/ Sn pe a 
k=1 1 n 


In addition, using partial summation and the bound logx < x — 1 for x > 0, we get 


Si, mn WE 
gin see ga78, AP me — (logs, — logs, ) 
1 n k=1 “n 
1 


n— S; 
= exp ) ~ (loss +1 — logs; ) 


k=1 “n 
n-1 
Sk+1 Sk Sy 
< exp © { - +) = exp{1 - <e. 
k=1 Sn Si, Si 


Thus we have the desired inequality except with n in place of ©, but taking n > © 
gives what we want. Ly 


Again, one can easily state and prove power mean and continuous analogues, 


and again the conditions on the weights are probably not the weakest possible. 
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UNSOLVED PROBLEMS 
Edited by Richard Nowakowski 


In this department the MONTHLY presents easily stated unsolved problems dealing with 
notions ordinarily encountered in undergraduate mathematics. Each problem should be 
accompanied by relevant references (if any are known to the author) and by a brief 


description of known partial or related results. Typescripts should be sent to Richard 
Nowakowski, Department of Mathematics & Statistics & Computing Science, Dalhousie 
University, Halifax NS, Canada B3H 3J5, rin@mscs.dal.ca 


Periods in Taking and Splitting Games 


Ian Caines, Carrie Gates, Richard K. Guy, 
and Richard J. Nowakowski 


In 1997, J. H. Conway asked who wins Couples-Are-Forever? This game is played by 
two players playing alternately with heaps of counters. A move is to choose a heap 
and split it into two non-empty heaps, except that no heap of 2 may be split, hence 
the name. The player who moves last wins. 

Without the exception, the game is easy to analyze. The next player wins if and 
only if there is an even number of heaps that contain an odd number of counters. 
This is the archetypal she-loves-me-she-loves-me-not game found in [2, pp. 113, 115]. 

Recall that the nim-value of a position G, denoted by @(G), is defined 
inductively. The end-positions, positions from which there are no moves, have 
value zero. Any other position has value the minimum excluded non-negative 
integer (mex for short) that does not occur among the values of its followers, i.e., 
those positions that can be reached in one move. If the nim-value of a position is 
zero then the next player cannot win; if it is non-zero then the next player does 
have a winning strategy. 

Couples-are-Forever is a disjunctive sum of games, that is, a player must choose 
one heap and make a move in it. The nim-value of a disjunctive sum of games is 
the nim-sum of the nim-values. This is obtained by writing out the nim-values in 
binary and adding without carrying, denoted by ©. The single heap games of 
Couples-are-Forever have nim-values: #(1) = #(2) =0 since these are end- 
positions. A heap of 3 can be split into heaps of size 1 and 2 so that (3) = 
mex{¥(1) © (2)} = mex {0 © 0} = mex {0} = 1. Similarly, 


G(4) = mex{F(1) © F(3), (2) + F(2)} 
= mex {0 © 1,0 + 0} = mex {1,0} = 2. 


The first twenty-five nim-values are: 0, 0, 1, 2, 0, 1, 2, 3, 1, 2, 3, 4, 0, 3, 4, 2, 1, 3, 2, 1, 0, 
2, 1,4,5. By automating the work, we have found the nim-values for heaps of size 1 
through 50 million and no pattern has emerged. The value 0 occurs fourteen times 
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and 1 twenty-six times. The largest nim-value found is #(19, 739,544) = 325 and 
the most common values are: 


nim-value: 256 257 129 128 134 135 
no. of occurrences: 1,798,261 1,596,606 1,059,556 1,058,974 894,436 893,216 


The behavior of the nim-values is quite striking when they are plotted against the 
heap size. The upper envelope of the graph rises in a linear fashion to approxi- 
mately 300 at heap size 20,000. Then it is almost constant for the rest of the 50 
million terms. A ‘sparse-space-common-coset’ phenomenon occurs; see [5]. A rare 
nim-value in this game is one which has an even number of 1-digits in its binary 
expansion if the digits in the ‘ones’ and ‘sixteens’ places are ignored. Values with 
an odd number of such 1-digits are called common. The nim-sum of two rare 
values, and likewise of two common values, is a rare value and so rare values tend 
to get excluded in the mex operation, keeping them rare. This allows for fast 
calculation of the nim-values: find the smallest common value that does not arise 
as the nim-sum from heaps, one with a rare value, the other with a common. Then 
the mex is either this, or is a smaller rare value, so compute sufficient ‘common- 
common’ (and ‘rare-rare’) values until all smaller rare values are excluded, or, 
exceptionally, you’ve discovered a new rare value. 

If the rare values stop appearing and the sequence of nim-values remains 
bounded, then it eventually becomes periodic. For Couples-are-Forever, the last 
rare value found is ¥(20,628) = 277. Unfortunately we cannot prove that there are 
no more. 


Question: Is Couples-are-Forever ultimately periodic? 


If it is ultimately periodic then the period length cannot divide the size of any 
heap that has nim-value zero. The nim-value zero occurs for heap sizes 1, 2, 5, 13, 
21, 31, 47, 73, 99, 125, 151, 177, 315, and 409, so the period cannot be any of: 1, 2, 
3,5, 7, 9, 11, 15, 21, 25, 31, 33, 35, 45, 47, 59, 63, 73, 99, 105, 125, 151, 177, 315, or 
409. 

Are there any other reasons to believe that Couples-are-Forever is periodic? 
Some variants are easily shown to be periodic. If heaps of size 1 through n are not 
allowed to be split then: the period is 0,1,2..., with a pre-period of n — 1 zeros 
if n is odd; the period is n + 1,1,2,...,n with a pre-period of n zeros followed 
by 1,2,...,7,1,2,...,.2+1,1,2,...,.n+1,1,2,...,n—1,0 if n=2* —4 for 
k = 3. However, the other values of n give games as intransigent as the original; 
see [3] for other results. 

Couples-are-Forever is very similar to Grundy’s Game (split a heap into two 
unequal heaps) in that the game ends with heaps of just one or two. This also 
defies analysis, in spite of several energetic attempts. It was during his computa- 
tion of the first quarter million nim-values that Elwyn Berlekamp discovered the 
sparse-space-common-coset phenomenon (see [2, p. 111]); here the sparse space 
comprises vectors of even weight after deleting the units digit. The 1,272nd rare 
value is (36,184) = 157 and then, apart from #(82,860) = 108, no more. Nor did 
Mike Guy find any more among the first ten million. But Anil Gangolli found 
F(47,468,481) = 261, (48,142,376) = 265 and 11 other rare values between. Dan 
Hoey has reached 11 billion, but found neither a pattern nor any more rare values. 


Wild conjecture: All finite octal games are ultimately periodic. 
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An octal game is a ‘take-and-break’ game with Guy-Smith code [6] .d,d,d,..., 
where the base 8 digits d, = 2°h,, + 2'h,, + 2°hj with h,,; = 1 or 0, signify that 
you may(1) or may not(0) take i counters from a heap, provided that you leave 
exactly 7 nonempty heaps in its place. For example, in the game .172, you may take 
one counter if it’s on its own, two counters in any case, leaving the remainder, if 
any, in one or two heaps, or three counters from a heap provided that it is not the 
whole heap. The analysis of this game has not been completed, nor has it for 
Officers, .6 (take one counter from a larger heap, leaving the rest as one or two 
heaps), nor for .007 (Treblecross or one-dimensional tic-tac-toe; remove three 
contiguous skittles from a row), nor for any of the following games ((7, pp. 
475-476], where .644 should not appear, since its period length, 442, was discov- 
ered by Richard Austin [1]): .06, .14, .36, .64, .74, .76, .004, .005, .006, .016, .104, 
106, .114, .135, .136, .142, .143, .146, .162, .163, 324, .336, 342, .362, .371, 374, 
404, .414, .416, .444, .454, .564, .604, .606, .744, .764, .774, .776, and plenty of 
games where you're allowed to take more than three counters. 


We return to the splitting games. A pure period is a sequence of numbers 
(a,,a,...,a,) where 


a; = mex {a; ® a;-;li = 1,2,...,k —1, where the subscripts are taken cyclically}. 


When a pure period exists, the corresponding splitting game has #(n) prescribed 
for n = 1,2,...,k and thereafter, Y(n + k) = F(n). For example, 0 1 is a pure 
period. There are pure periods 2” 3? 2” 1 of length 2n + 3, and 1 3 0)" 311 of 
length 2n + 4, for every n > 1, where (*)” means * repeated n times. 

A pure period of length k can have no value greater than [k/2], since there are 
only [k/2] terms when taking the mex and one of the terms must be zero for the 
maximum nim-value to occur. We have calculated all pure periods up to length 11, 
of which there are many. 

For period lengths one through eleven, there are 1, 2, 1, 2, 3, 6, 7, 18, 13, 58, 50 
pure periods respectively. All have the form SS~'1 if of odd length and SaS~'1 if 
even, where S~' is the sequence S in reverse and a is a single number. 


What finite sequences of nim-values are pure periods’ Must they have the form 
SaS~'1 or SS~'1? 
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PROBLEMS 


10725. Proposed by Vasile Mihai, Toronto, ON, Canada. Fix a positive integer n. Given a 


permutation a of {1,2,...,n}, let f(@) = pe (@(i) —a (i+ 1)’, where a(n+1) = a@(1). 
Find the extreme values of f(a) as a ranges over all permutations of {1,2,..., n}. 


10726. Proposed by Donald E. Knuth, Stanford University, Stanford, CA. Start in state 0. 
For every nonnegative integer k, stay in state k for X, units of time, then go to state k + 1. 
What is the probability of being in state s after ¢ units of time, assuming that X; is distributed 
exponentially (a) with mean 1/(kK +1)?  (b) with mean 1/ 2k9 


10727. Proposed by Jean Anglesio, Garches, France. Let m be a fixed positive integer. For 
a positive integer n, let s,,(n) be the sum of the mth powers of the decimal digits of n. For 
example, 53(172) = 17+ 7° +2? = 352. Starting with any positive integer no, construct a 
sequence of positive integers by setting nz = S»(nz_1) for every k > 1. 

(a) Show that no, m1, 2, ... 1S eventually periodic. 

(b) Show that only finitely many periods are possible as no varies. 


— 10728. Proposed by Titu Andreescu, American Mathematics Competitions, Lincoln, NE. 
Determine all functions f: Z — Z satisfying 


fe+yt2) =(f@) + (FO) + (FO) 
for all integers x, y, and z. 


10729. Proposed by David P. Bellamy and Felix Lazebnik, University of Delaware, Newark, 
DE. Let I C Rbe an open interval, and let 1 be a positive integer. Characterize the functions 
f: I — R that have a continuous nth derivative and satisfy 


FO + pi fC +--+ pn-if’ + Pnf =0 


for some continuous functions p1, p2,..., Pn on I. 
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10730. Proposed by Walther Janous, Ursulinengymnasium, Innsbruck, Austria. Fix an 
integer n > 2. Determine the largest constant C(n) such that 


2 (xj — xi) = C(n)- anin (xi+1 = xi)” 


l<i<j<n 
for all real numbers x; < x2 <--+ < Xp. 


10731. Proposed by M. J. Pelling, London, England. Let A be an n-by-n real symmetric ma- 
trix, and consider the quadratic form Q(x) = x! Ax forx € R”. Let C be the cube [—1, 1]”. 
Prove that max,ec Q(x) is at least as large as the sum of the positive real eigenvalues of A. 


SOLUTIONS 


Connected Sets of Periodic Functions 


10434 [1995, 170]. Proposed by Daniel Goffinet, Saint Etienne, France. Let P be the set 
of nonconstant periodic mappings from R to R, endowed with the topology derived from 
the supremum norm. Find the components of P. 


Composite solution I by Kiran S. Kedlaya, Massachusetts Institute of Technology, Cam- 
bridge, MA, Kenneth Schilling, University of Michigan, Flint, MI, and Arlo W. Schurle, 
University of Guam, Mangilao, Guam. For any function f: R — R, define || {|| to be 
sup{ | f(x)|: x € R}, which is taken to be co when the set of values of f is unbounded. 

We first show that f and g are in different components of P if || f — g|| = oo. Let 
B, ={k € P: ||k — g|| < co}. By the triangle inequality B, is an open set, andifh ¢ By, 
then the triangle inequality again shows that {z: ||z —h|| < 1} By, = %. Consequently 
B, is both open and closed, and so the component of P containing any given g € P must 
lie in Bg. 7 

Conversely, if f — g is bounded for f, g € P, then there is an arc in P joining f 
to g. First, suppose that f and g have a common period p. The standard path k,(x) = 
(1 — t) f(x) + tg(x) for 0 < t < 1 consists of functions having p as a period, and since 
lf — gll is finite, k; depends continuously on t. There is a danger that some k;(x) is a 
constant function, but this can happen only if f is an affine function of g, that is, there are 
constants A and B with f = Ag + B. In this case, the function h(x) that is equal to f(x) 
except at integer multiples of p, where it is f(x) + 1, is at bounded distance from both f 
and g and is not an affine function of either. A path from f to g can be obtained by taking 
the standard path from f to h followed by the standard path from h to g. 

Suppose now that f and g have no common period. Let r be a period of f and let s be 
a period of g. We wish to construct h that has both r and s as periods such that || f — hl 
(and hence also ||g — All) is finite. To do this, pick an arbitrary set of coset representatives 
for R/(rZ+ sZ), define h to agree with f at these values, and extend by periodicity. Then 
for any x, let x = y +rm-+sn, where y represents the coset containing x. Then 


A(x) — fx) =|fO) — Ff + sn)| 
=(f(y) -—e(y) + e(y + sn) — f(y tsn)| < 2||f - ell 


Since f and h have common period r and || f — h|l is finite, there is a path from f to h, and 
since h and g have common period s and ||h — g|| is finite, there is a path from h to g. 


Composite solution II by Fredric D. Ancel, University of Wisconsin, Milwaukee, WI, Phil 
_ Bowers and John Bryant, The Florida State University, Tallahassee, FL, and the proposer. 
We assume that “mapping” means “continuous function”. Then two functions in P belong 
to the same component if and only if they have commensurate periods. As in solution I, the 
components are path-components. 
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Given f € P with period 27n, we form a path from f(x) to sinx via the homotopy 
h,y(x) = (1—-t) f(x)+t sin x for0 < t < 1. Since ||h, — h,|| < (if + 1) |s — t|, the map 
t +» h; is continuous from [0, 1] to the space of all functions from R to R with the topology 
derived from the supremum norm. Each h; has 2nz as a period. This gives the desired path 
in P if no h;(x) is constant, i.e., unless f(x) = Asinx + B with A < 0. For such f(x), 
the path k,(x) = (1 — t) f(x) —tsinx connects f(x) to —sinx = sin(x +7). The path 
n;(x) = sin(x + (1 — t)z) then connects — sin x to sinx in P. The continuity of t H n, 
follows from the mean value theorem. If f and g have commensurate periods, say both are 
a multiple of p, then there are continuous paths in P connecting f(x) to sin(27x/p) and 
g(x) to sin(27x/p), hence there is a path from f to g. 

For a fixed real number p, let C, denote the set of functions in P whose period is a rational 
multiple of p. We now show that C,, is an open subset of P. Choose f € Cy. Since f is not 
constant, there are real numbers x and y such that f(x) < f(y). Sete = ( f(y) -— f() ) /3. 
We show that g € P and || f — g|| < € implies g € C,. Ifnot, then there is g with || f — g|| < 
€ such that the period of g, say q, is an irrational multiple of p. Since f is continuous, there is 
6 such that f takes the interval (y — 6, y + 4) into the interval (f(y) — «, f(y) + €). Since 
f € C), f also takes the interval (y — mp — 6, y —mp +5) into (f(y) —«€, f(y) +6) 
for each integer m. Since || f — g|| < €, g takes each (y —mp —6, y—mp +5) into 
(f(y) — 2€, f(y) + 2€). Since p/q is irrational, the numbers mp + ng for integers m and 
n are dense in R, so there are integers m and n such thatmp+ngq € (y —x —6, y—x +4), 
which gives x + ng € (y—mp—6,y—mp+6). Thus, g(x +nq) < f(x)+e = 
f(y) —2€ < g(x +ngq), acontradiction. This completes the proof that C,, is an open subset 
of P. 

Since the sets C, partition P into connected open sets, each set Cp is acomponent of P. 


The Plane Covered by Disks 


10440 [1995, 273]. Proposed by Marius Cavachi, Constanta, Romania. Show that the 
Euclidean plane cannot be covered with circular disks having mutually disjoint interiors. 


Solution I by Sam Northshield, SUNY, Plattsburgh, NY. We show that R* (k > 2) cannot be 
covered by metric balls having mutually disjoint interiors. 

Note that every set of balls with disjoint interiors is countable, since each contains a 
different point with rational coordinates. Let {B,: n € N} be a set of closed metric balls 
in R* (k > 2) with mutually disjoint interiors. A point of intersection of two of the B, is 
called an intersection point. Since the intersection of two distinct B, has at most one point, . 
there are only countably many intersection points. Hence we may choose a straight line 
segment y with its endpoints in the interiors of two distinct balls and such that y avoids all 
intersection points (here is where we need k > 2). Let C be the set of points of y that are not 
in the interior of any B,. Then C is closed and nonempty. Furthermore, any neighborhood 
of point x € C must contain another point of C; otherwise x would be an intersection point. 
Hence C is perfect, and thus uncountable. Now, for any n, the segment y intersects 0B, in 
at most two points, so there is x € C not in any B,. It follows that |) B, 4 R*. 


Solution II by Simeon T: Stefanov, Sofia, Bulgaria. Suppose the contrary. As in Solution 
I, there are only countably many intersection points. Let L be a line that avoids these 
points, and consider the family J = {LN By: n € N}, acountable cover of L with disjoint 
closed bounded intervals. To see that no such cover is possible, construct a nested family 
of compact intervals A,, C L such that A, M B, = 9, but A, meets at least two intervals in 
F. Then (| A, is nonempty, but no point of this intersection belongs to any set in F. 


Editorial comment. Victor Klee noted that the proof that there are only countably many 
intersection points requires only that the B, be rotund (i.e., strictly convex). His solution 
followed Solution II, with the last part traced back to W. Sierpifiski, Un théoréme sur les 
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continus, 76hoku Mat. J. 13 (1918) 300-303. Klee also noted that circular disks are smooth 
(i.e., possess a continuously differentiable parameterization) as well as rotund. For more on 
smooth tilings, see V. Klee, E. Maluta, and C. Zanco, Tiling with smooth and rotund tiles, 
Fund. Math. 126 (1986) 269-290; V. Klee and C. Tricot, Locally countable plump tilings 
are flat, Math. Ann. 277 (1987) 315-325; and P. M. Gruber, How well can space be packed 
with smooth bodies? Measure theoretic results, J; London Math. Soc. (2) 52 (1995) 1-14. 
D. G. Larman, A note on the Besicovich dimension of the closest packing of sphere in 
R,,, Proc. Cambridge Philos. Soc. 62 (1966) 193-195 shows that, in the case of packing of 
circular disks in the plane, the uncovered set has Hausdorff dimension at least 1.03. 
Solved also by G. E. Bredon, P. Budney, J. D. Clemens, J. Cobb, R. Holzsager, A. A. Jagers (The Netherlands), V. Klee, J. H. 
Lindsey II, O. P. Lossers (The Netherlands), R. Martin (Germany), L. E. Mattics, M. Misiurewicz, I. Namioka, O. Nanyes, C. G. 


Petalas & T. P. Vidalis (Greece), C. Popescu (Belgium), A. W. Schurle, J. H. Shapiro & T. L. McCoy, A. A. Tarabay & R. Barbara 
(Lebanon), and the Anchorage Math Solutions Group. 


Random Perfect Matchings 


10587 [1997, 361]. Proposed by Joaquin Gomez Rey, Madrid, Spain. Let K2, be the 
complete graph on 2n vertices. Let P, be the probability that two random perfect matchings 
of K2, are disjoint. What is limp—oo Pn? 


Solution by José Heber Nieto, Universidad del Zulia, Maracaibo, Venezuela. The limit is 
e— 1/2 = 0.60653. The number of perfect matchings of K2, is M, = (2n)!/(2"n!). Given a 
perfect matching G of K2, and aset J of j edges of G, there are M,_ ; perfect matchings of 
K>, containing J. Therefore, the inclusion-exclusion principle yields i =0(- 1)/ (“) Mn-j; 
as the number of perfect matchings of K2, disjoint from G. Thus 


m= vi(’ ee, J 


Now limy-+oo P, can be computed by applying Lebesgue’s dominated convergence theo- © 
rem. Let X = {0, 1, 2,...}, and define ameasure uz on X by w({j}) = 1/j!. Let fx: X > R 
be defined by 

' (—1)/n!Mn_j; = 
— ee a | J —-————————___———— , 
n= eM aa 
Then limy-+00 fa(j) = (—1/2)/. Furthermore, | f,(j)| < 1, and the constant function 1 is 
integrable, since fy 1du = ))729 1/j! =e. Therefore, 


2 E12) 
lim Py = tim [ frdu = | lim fad = > = et, 
n->0©o X : X j! 


j=0 
Solved also by R. J. Chapman (U. K.), R. DiSario, J. Grossman, J. Labelle, D. Tenny, NCCU Problems Group, and the proposer. 


Characterizations of the Medial Triangle 


10588 [1997, 361]. Proposed by Marcin Mazur, The University of Chicago, Chicago, IL. 
Let A,;A2A3 be a triangle. For i = 1, 2,3, let B; be a point on side Aj4;Aj42, where 
subscripts are taken modulo 3. 

(a) Show that | A; Bj+1| + |B; Bi41| = |Ai Bi+2| + |B; Bi+2| for i = 1, 2, 3 if and only if B; 
is the midpoint of Aj41 Aj+2 fori = 1, 2, 3. 

(b) Show that |A; Bj+1| + |AiBi+2| = |B; Bj41| + |B; Bi+2| fori = 1, 2, 3 if and only if B; 
is the midpoint of Aj+1Ai+2 fori = 1, 2, 3. 
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Solution by the proposer. If B; is the midpoint of Aj+1Aj+2 for i = 1, 2, 3, then triangles 
A,A2A3 and By, B2Bz3 are similar, so |B, B2| = (1/2)|A1 A], |B2B3| = (1/2)|A2A3], and 
|B3 By| = (1/2)|A3Ai1|. Hence 


1 1 
|A; Bg| + |B, B2| = Bs |A,A3| + a |A,A2| = |B, B3| + |A;B3l, 


and similarly for the other conditions of both parts. 

(a) We prove that for any triangle B; B B3 there exists exactly one triangle A; A2A3 such that 
|A; Bj44|+|B; Bizi| = |Aj Bi42|+|B; Bj+2| fori = 1, 2,3. This implies our assertion. Fix 
a triangle B, Bz B3, and suppose that for a triangle A; A2A3 the conditions are satisfied. Let 
(i, j,k) bea permutation of (1, 2, 3). Consider the hyperbola with foci B; and By passing 
through B;. Since-|A;B;| + |B; B;| = |A;By| + |B; Bg|, the hyperbola passes through Aj. 
Write h; for the part of the branch of the hyperbola passing through A; that is on the opposite 
side of the line B; By, from B;. Since B; and By are the foci of the hyperbola, h; is entirely 
contained in the union ofall lines joining A; and some point on the segment B; Bx. 

Now suppose that A is any point on hy, different from A,. (This A is a candidate for 
the vertex A, in a new triangle satisfying the conditions.) If A is inside triangle B2 A, B3, 
then the line from A through B> intersects h3 in a point P that is on the opposite side of 
the line A7A3 from Aj, and if A is outside of Bz A, B3 then P is on the same side of A A3 
as Aj. (Point P is the candidate for point A3 of the new triangle.) The same holds for the 
intersection Q of the line A B3 with h2 (the candidate for A> of the new triangle). Therefore, 
the line segment P Q does not pass through B;. We conclude that A cannot be a vertex of a 
triangle that satisfies our requirements. A similar argument shows that no point A outside 
triangle Bz A, B3 can be a vertex of a triangle that satisfies our requirements. Thus A; A2A3 
is the only triangle for which the conditions hold. 

(b) Let a, = (1/2)|A; Aj|, by = |B; B;|, and x; = a; — |A;B;|, where (i, 7, k) is an even 
permutation of (1, 2, 3). By hypothesis, a; + xj + a, — x, = bj + by. Adding two of these 
equations and subtracting the third yields bj = a; — x; + xx, So 


b? = a? + a; + i — 2ajxj — 2xjX~ + 2ajXK (1). 


By the law of cosines we obtain b? = (a; + xj)” + (ap —xz)* — 2(a; + Xj)(a~ — x) COS Aj. 
Since cos Aj = as + a? — a; /2aja, we get after simple transformations 


Dans, 2D Be 5D he Deh Ne ee 2 DN ny PPD ie Dn ON ae OD 1D 
by = yp + XP +E + at +0; —a,) = Pa + a; onan a ra + a;—a;) (2) 
Let z; = x; /a;. Comparing expressions (1) and (2) for b?, we get 

Zj(aj + 4; — ax) — ZR (ae +; — aj) + 2j2K(aj + ae — aj) = 0. 


If one of the z;’s is 0, then all of them vanish. If they are all nonzero, then dividing by z;z, 
and adding all three equalities we get aj + a2 + a3 = 0, which is evidently false. Therefore, 
all the x;’s vanish and the.assertion is proved. 


Solved also by M. Vowe (Switzerland) and GCHQ Problems Group (U. K.). 


Binary Expansions and kth Powers 


10596 [1997, 456]. Proposed by Paul Bateman, University of Illinois, Urbana, IL, and 
David Bradley, Simon Fraser University, Burnaby, BC, Canada. 
(a) Prove the identity 


2k-1_]} 
YD OGY + pt = et 24-DEDAGy + gM! — 1/2), 
j=0 
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where 7(j) is the number of ones in the binary expansion of the nonnegative integer j. 
(b) Use part (a) to infer that there is a positive integer s = s(k) such that every integer n 
is expressible in the form n = Ext + €nxk + +--+ Po in infinitely many ways, where 


€; = +1 for 1 <i < s and where x1, x2,..., Xs are distinct positive integers. 


Solution I to part (a) by David Callan, University of Wisconsin, Madison, WI. Withn = k—1, 
equating coefficients of y reduces the proposed identity to 


qn_| O if r<n, 
Y -pr7" = | nave if r =n; 
7=0 (n+ 1)!2"@-D/2(9" — 4) /2 if r=n+1. 


Let S(j) denote the set of positions containing 1 in the binary representation of j, so that, 
for example, 5(13) = $((1101)2) = {1, 3, 4}). Write 7 <x k when S(j) © S(k). Consider 
a set of boxes labeled 1,...,2” — 1. For0 <i <n —1, let G; be the set of boxes with 
labels 2',..., 2+! — 1. Note that |G;| = 2! fori > 0. 

Let f(j) be the number of placements of r distinguishable balls into ), s(j) Gi. Clearly 
f(j) = j”. Let g(j) be the number of such placements in which, for each i € S(j), at 
least one ball is in at least one box in G;. By the Inclusion-Exclusion Principle, g(k) = 
Yj e(—1)"™-1) f (7). In particular, g(2" — 1) = 2") (- yr 7”. 

‘Since S (2"” — 1) = {1,...,n}, the distributions counted by g(2” — 1) are those with all 
n groups nonempty. When r <n, there are none. When r = 27, one of the 2' boxes in G; 
is used, for each i. When r = n + 1, we distribute 1 balls and then one more, dividing by 
2 to eliminate overcounting. Thus both sides of the identity equal g(2” — 1). 


Solution IT by Richard Stong, Rice University, Houston, TX. 

(a) Letting A, f(y) = f(y +r) —f(), the left side of the identity is Ay Az Aq --- Ax-2 y*. 
If f is a polynomial of degree n with leading coefficient c, then A; f is a polynomial of 
degree n — 1 with leading coefficient crn. Since we have applied k — 1 such operators, the left 
side of the identity is a polynomial of degree 1 with leading coefficient k!2°*!+"+@—-) — 
ki1g&—-Dk—-2)/2, 

Since n(2*-! —1—i) = k—1-—n(i), the terms for 7 =i andj = 2‘-!_]—j inthe sum 
exactly cancel if y = —(2*—! — 1)/2. Thus the left side of the identity is the polynomial of 
degree 1 with leading coefficient k12%—-D(—-2)/2 that vanishes at y = —(2*—! — 1)/2. This 
agrees with the right side. 

(b) If k = 1, then s = 3 suffices, because the identities n = (n +2-+m)—(1+m)—1 and 
n=(m+2+n)—(m+1-—n) — (1 —7n) give suitable representations for all m > 1 in 
the cases n > O andn < QO, respectively. 

Now consider k > 2. Let M = k!2®-D@—2/2, and let s(k) = M + 2*-!. Given 
any integer n, choose integers g andr such thatn = (2k-! — 1)M/2 + Mq +,7, where 
O<r<M. Letx,..., x, be multiples of M, and let x-+41, ..., x, be numbers congruent 
to 1 modulo M. Nown+ Ye en is congruent to (2*-! — 1)M/2 modulo M. Thus for 


some y we have n + ye , ‘i = M(y+ (Qh 7) /2). The identity in (a) now yields 


gk-1_y M 


i= > Ge ot =), 


j=0 i=l 


The numbers x1, ..., x;g were chosen arbitrarily subject to congruence conditions mod- 
ulo M; there are infinitely many such choices. By fixing x,,...,xjy—, and making xy 
sufficiently large, we can ensure that y exceeds xy, since k > 2. Thus we have infinitely 
many choices in which xj,...,xy,y,y+1l,...,y+ 2k-! _ 1 are distinct, as desired. 


Solved also by K. MclInturff, J. H. Lindsey II, GCHQ Problems Group (U. K.), R. J. Chapman (U. K.), and the proposers. Part (a) 
solved also by D. Beckwith. 
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A Supremum of Sine Differences 


10604 (1997, 567]. Proposed by Joseph Rosenblatt, University of Illinois, Urbana, IL. 
(a) Determine positive constants c and C such that if0 <a < b then 


¢(1- 2) < <C(1-<). (+) 


(b)* What are the largest constant c and smallest constant C such that (*) holds whenever 
O<a <b? 

Solution of part (a) by Thomas Hermann, SDRC, Milford, OH. We may take c = 2/m and 
C = 4. To see this, let p(x) = sinx/x, 4 = a/b, and w(A, x) = (p(x) — g(Ax))/(1 — A). 
The problem is to find positive lower and upper bounds for M(A) = sup,.9 |w(A, x)|. Since 
lp(x)| < 1 for all x, 


sin(ax) — sin(bx) 
up | —- ———— 


x>0 ax bx 


2 
UA, x)| < ee 4 (1) 


when 0 < A < 1/2. By the Mean Value Theorem, there is a y € [Ax,x], such that 
(A, x)/x = y'(y). Now xg'(y) = (x/y)(cos y — g(y)), so 


2 
WA, <2-<2 <4 (2) 
y Xr 


when 5 < X < 1. Combining (1) and (2), we obtain M(A) < 4 for all A in (0, 1). 

To get a lower bound, observe that w(A, 7) = sin(Az)/ (AC — A)z) = w1 —A, 7), so 
it is enough to consider the case when A € (0, 1/2]. Since g(x) is decreasing on (0, 2/2], 
(A, ©) > (1/(1 — A)) (sin(r/2)/ (7 /2)) = 2/m. Therefore 2/7 < M(A) < 4. 


Editorial comment. For part (b), John H. Lindsey II and the GCHQ Problems Group in- 
dependently computed that the largest value for c is approximately 1.0631036 and the 
smallest value for C is approximately 1.3805662. Lindsey used Maple to search for these 
values and used estimates on derivatives to prove that the optimal value of C satisfies 
1.380566167 < C < 1.380577012. 


Part (a) also solved by R. J. Chapman (U.K.), T. Hermann, J. H. Lindsey II, GCHQ Problems Group (U.K.), and the proposer. 
A Convergent Series 


10657 [1998, 366]. Proposed by Jovan Vukmirovic, University of Belgrade, Belgrade, 
Yugoslavia. Let ¢ be a strictly increasing function from (0, oo) onto (0, oo). Prove that if 
°° 1/(nd7!(n)) converges, then )°°°. , 6(n)x" converges for |x| < 1. 


Solution by Kenneth Schilling, University of Michigan, Flint, MI. For x > 1, 
Lx] x 
1 1 1 
) -> | —dt = 5 In x. 


Thus, since ¢ is increasing, we have 


Inx EL od 
ae i ame SaaS 


By hypothesis, this expression converges to 0 as x —> oo. Hence lim; 9 In x / o'(x) =0. 
Since ¢(x) > co asx —> OW, we get limy+oIn@(x)/x = 0. Exponentiation yields 
limy—o0(@ (x))!/* = 1. Thus the power series converges for |x| < 1 by the root test. 

Solved also by S. Amghibech (France), J. Arregui, G. L. Body (U. K.), D. Borwein (Canada), D. Bradley (Canada), K. Dale & I. 


Skau (Norway), J. Fitch, K. Ford, G. L. Isaacs, P. Lang, J. H. Lindsey II, A. Stenger, T. V. Trif (Romania), T. Trimble, R. Vermes 
(Canada), C. Y. Yildirim (Turkey), GCHQ Problems Group (U. K.), and the proposer. 
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REVIEWS 


Edited by Harold P. Boas 
Mathematics Department, Texas A & M University, College Station, TX 77843-3368 


The French Mathematician. By Tom Petsinis. Walker and Company, New York, 1998, 
v +426 pp., $24. 


Reviewed by Tony Rothman 


It is not uncommon in our society for a celebrity to read a novel, recognize herself 
in one of the characters, and sue the author for defamation. The plaintiff’s position 
in such cases is surely peculiar: On the one hand she must argue that the offending 
character is real; on the other hand, she must argue that the character is not real 
enough. While reading Tom Petsinis’s new novel about Evariste Galois, one 
wonders throughout whether Galois would be pleased with this portrait or would 
call his lawyer. Galois is not forthcoming. As his advocate I would be tempted to 
advise him to call not a lawyer, I think, but a book doctor. However, this case is 
one for the jury to decide. 

Galois’s life, straddling as it did fact and fiction, naturally lends itself to 
novelization. Most mathematicians and scientists are familiar with the outlines of 
the story. A mathematical prodigy, Galois came of age in the aftermath of the 
Napoleonic empire. He twice failed the entrance examination to the prestigious 
Ecole Polytechnique, where he expected to study mathematics, and enrolled in the 
Ecole Normale, only to be expelled as a revolutionary firebrand. Arrested on two 
occasions for seditious activities in the wake of the 1830 revolution, he spent eight 
months in prison, was released in April 1832 on the advent of the great cholera 
epidemic, and a month later managed to get himself mortally wounded in a duel. 
The night before, he wrote out his scientific last will and testament, annotated his 
papers, and bequeathed his legacy—group theory—to the world. He was twenty 
years old. 

One of the great romances of science, Galois’s life has been the inspiration for a 
dozen novels, plays, and films. My own 1982 MONTHLY article [4] was actually the 
outcome of research for a play about Galois and the Russian poet Pushkin, whose 
life followed a similar trajectory. Petsinis has evidently followed the general outline 
of my article. Gone is the Galois of E. T. Bell’s Men of Mathematics [1], who is 
done in by the massed forces of stupidity arrayed against him and the shadowy 
political intrigues of Bell’s imagination. Gone also is Leopold Infeld’s Galois, hero 
of the proletariat. Vanished as well is Infeld’s pretense that his book [3] is actually 
a work of “faction,” a nonfiction novel. No, Petsinis has written a work of fiction, 
and he has no obvious ax to grind. He has attempted to portray a troubled youth 
living in turbulent times, a victim as much of his own personality as of external 
misfortunes. The French Mathematician aspires to be a believable portrait, both 
psychologically and historically. 

It is personally gratifying to see one’s work transmogrified and so transformed 
passed into the future, and as a rule one should review the work that has been 
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created, not the one that hasn’t been created. Nevertheless, my feeling on putting 
down The French Mathematician remains that Petsinis might have done better to 
pen a different book. I am not speaking of historical accuracy. There are many 
points one can quibble with—the book jacket copy gets the age of Galois at his 
death wrong, which does little to instill confidence; the ages of Galois’s instructors 
are incorrect, and so on. I may myself have misled Petsinis about Galois’s 
opponent in the celebrated duel. In my MONTHLY article I followed Alexandre 
Dumas, who stated that Galois’s opponent was Pescheux D’Herbinville, a fellow 
republican. D’Herbinville figures here as Galois’s adversary, though thankfully no 
dark plots, agents provocateur, or prostitutes are invoked, merely jealousy and 
honor. Well before the 1989 version of my article [5], however, I had been 
informed of the work of André Dalmas [2], who presents a clipping from a Lyon 
newspaper dated several days after the duel. The clipping identifies Galois’s 
opponent as L. D., initials that do not match any of Galois’s acquaintances. 
Nevertheless, judging from the article, the most probable opponent was Vincent 
Duchatelet, one of Galois’s best friends. The duel itself seems to have been a grisly 
version of Russian roulette: the adversaries had both fallen in love with the same 
girl, but “because of their old friendship they could not bear to look at each other 
and left the decision to blind fate. At point-blank range they were each armed with 
a pistol and fired. Only one pistol was charged” [5]. 

No, historical accuracy is not the issue; this is a novel. At issue is verisimilitude 
—Petsinis strives for it—and literary achievement. Increasingly I am convinced 
that God does not reside entirely in the details. To the contrary, the success of a 
work is largely determined by the basic decisions an author makes at the outset, 
most importantly: What is the book about? But also such large stylistic decisions 
as: Will the book be told in first or third person? Present or past tense? Petsinis 
has decided to tell the story from Galois’s own perspective. This decision results in 
a threefold hurdle that is almost impossible to surmount: He must get inside the 
head of an adolescent; he must get inside the head of an early 19th-century — 
adolescent; and he must get inside the head of an early 19th-century adolescent 
who happens to be a mathematical genius. 

Petsinis is either very brave or very foolhardy to have made the attempt, and if 
the effort has been only partly successful, that hardly comes as a surprise. To me 
what is most convincing are the frequent references early on to Pindar, Catullus, 
and Hugo, not to mention Archimedes, Euclid, and Pythagoras—classical litera- 
ture and mathematics that any child of Galois’s generation would have been 
immersed in. What is sometimes believable as well is Galois’s black-and-white 
outlook on things. During the first part of the novel, which takes place at the lycée, 
Evariste is only 15, and a black-and-white worldview comes, as they say, with the 
territory. 

But here the decision.to tell the story in the first person does not serve. Galois’s 
writings are hardly voluminous and he was certainly not a happy young man—the 
word “detest” leaps off his pages. Petsinis’s Galois detests everyone and everything 
—he says so often. Without any other substantive characters in the book to 
balance such pronouncements, Galois’s story quickly becomes a one-note perfor- 
mance. There is little sign of the great affection for his parents and relatives that 
Paul Dupuy, Galois’s original biographer, describes and that is evident in his 
letters. To be sure, a constant danger of a first-person narrative is that the 
subsidiary characters will fade into the background. That is precisely what happens 
here. Evariste’s mother, who was responsible for his early education, was by all 
accounts an intelligent, lively woman even into old age, and one who saw religion 


370 REVIEWS [April 


in the light of ethics. Petsinis has portrayed her as something of a religious fanatic. 
“Your father has turned from God, she glares. Paris! The new Babylon! The haunt 
of the Evil One! The source of atheism! I fear the Apocalypse is at hand! The signs 
are there: crime, debauchery, chaos! Don’t go back, Evariste. Stay here and help 
me lead your father back to God.” I am sure Petsinis chose this portrayal to 
prepare the way for her break with Evariste, which did take place when he was 
about 20. But it seems so unnecessary, and the dialogue ludicrous, when her son’s 
behavior is itself enough of an explanation. 

By the same token, Galois’s father, the liberal mayor of Bourg-la-Reine, is a 
mere cipher. Women other than Galois’s mother, to the extent that they figure at 
all in the book, are mostly prostitutes standing in doorways (Galois detests the 
thought of sex; only mathematics is pure), and the July revolution consists mostly 
of crowds chanting the usual slogans: “Freedom of the press!” “Liberté, égalité ....” 
Somewhat strangely, Galois himself appears as a reluctant revolutionary who turns 
from mathematics to politics only after his father commits suicide in the wake of a 
Jesuit plot against him. True, the choice allows for the character’s evolution, but it 
‘does contradict everything anyone has written about Galois—and the passion 
evident in his own writings. 

Which leads to the second hurdle Petsinis faces: is this a 19th- -century Galois? 
Partly. As I have said, the milieu Petsinis has created strikes one as credible. On 
the other hand, the sense of verisimilitude is occasionally shaken by some of 
Galois’s internal monologues, which sound surprisingly modern: 


If I focus on the point, the line proves an illusion. Astonishing that something 
so intangible should be the basis of all geometry! In a flash, I see the 
indivisible point as the seed of creation. Perhaps the universe exploded from 
the primal point. Perhaps God is the primal point. Perhaps the soul is 
nothing more than a point. 


Similar references to the fate of stars and to the nature of space and time do 
sometimes make Galois seem more like a 20th-century astrophysicist than a 
19th-century mathematician. The problem is again exacerbated by the first-person 
narrative. We are able to see Galois only as he sees himself, not as others see him. 
In this instance the effect is not to bring Galois alive. Here is Petsinis’s Galois 
describing the famous Preface to his work he wrote while in prison: 


I also managed to write a four-page preface to the book that would contain 
my collected work... denounced patronage and attacked the Academy for 
losing my manuscripts .... This was followed by a summary of the two papers 
J had reworked.... 

I then went on to denounce Poisson and the examiners at the Polytech- 
nique. Even though I had reason to believe the scientific fraternity would 
greet my work with a condescending smile, I persisted in trying to have my 
work published. ... 

Finally, I tackled the question of why readers found my work so difficult, 
even incomprehensible, and concluded it was due to my inclination to 
dispense with formalisms and calculations. 


Here is some of the original: 


I tell no one that I owe anything of value in my work to his advice or 
encouragement. I do not say so because it would be a lie. If I addressed 


1999] REVIEWS 371 


anything to the important men of science ... I swear it would not be thanks. I 
owe to important men the fact that the first of these pages is appearing so 
late. I owe to other important men the fact that the whole thing was written 
in prison, a place, you will agree, hardly suited for meditation, and where I 
have been dumbfounded at my own listlessness in keeping my mouth shut at 
my stupid, spiteful critics .... The whys and wherefores of my stay in prison 
have nothing to do with the subject at hand, but I must tell you how 
manuscripts go astray in the portfolios of the members of the Institute, 
although I cannot in truth conceive of such carelessness on the part of those 
who already have the death of Abel on their consciences. 


Is there any question which is the living Galois? 

One is left feeling that a more effective strategy might have been to surround 
Galois by a group of third-persons trying to make out this character, refractory by 
Petsinis’s own admission. Be that as it may, the sharpest issue raised by The French 
Mathematician centers on the portrayal of a mathematician. Art is of course 
enriched when it can find inspiration in science and mathematics. But is what we 
have here a convincing portrait? As a mere theoretical physicist, I have no special 
expertise. Nevertheless, I find it peculiar that any mathematician would be engag- 
ing in mathematical metaphors while dying, as Galois does: 


I embraced that fatal sphere with my whole body. Dreams, memories, even 
the mathematics I had cherished and set down in my last will and testament 
—all receded. I am reduced to a singular point; in an instant I am trans- 
formed to i. 


i = an imaginary being 


But Petsinis’s Galois compares and contrasts everything with mathematics, even his 
true love: “Our fingertips touch. J summon all my courage to look into her eyes. 
There I am, circumscribed by her pupils. And in that instant 7 reveals its perfect 
proportion and is reduced to a chaos of digits, the Republic is created and 
destroyed, I am extinguished and reborn.” 

Well, perhaps I’ve never known a mathematician of Galois’s caliber, but I find 
this pretty laughable. And this brings me to the main point. What The French 
Mathematician shares with its predecessors is the intention to portray a mathemati- 
cian as a thing apart. It is an intention that Hollywood would appreciate; its typical 
scientists are direct descendants of Frankenstein. Strange that both the scientific 
and nonscientific communities seem.bent on keeping scientists and nonscientists as 
distant from each other as possible. When Andrew Wiles announced his proof of 
Fermat’s last theorem, the New York Times portrayed him as a man who had 
locked himself in an attic for seven years. Having been a close friend of Wiles 
during much of that time, I can state categorically that he is a man of broad 
literary interests who was an avid film-goer and was not above going out for a beer 
on Saturday evenings; he could also find work as a romance counselor. 

One can’t help suspect that there is some deep-seated need on the part of both 
the scientific and the nonscientific communities to maintain their separation. The 
result is not only a great deal of misunderstanding about what science is, but also a 
real antipathy between scientists and others—especially between scientists and 
artists. The situation is analogous to existing divisions in the publishing domain: 
literature is divided into genres, and each genre must have standard devices. 
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(Suspense novels must contain the prerequisite amounts of sex, violence, and 
endings that result in an explosion, narrowly averted.) Editors tell you this. Any 
novel that doesn’t conform to these conventions is deemed unbelievable. And so 
the reality of the genre has replaced the reality of the real world. 

In some respects the relationship between the sciences and the humanities long 
ago fell into the pattern of genre fiction. This is unfortunate and does not 
adequately reflect the reality of the world. In fact, mathematics and science have 
influenced art more, perhaps far more, than is usually acknowledged. In the late 
19th century, speculation on the meaning of the “fourth dimension” was extremely 
popular and influenced the work of futurist and suprematist artists, who in turn 
influenced world architecture through Bauhaus. Einstein’s relativity prompted 
artists and musicians of the 1920s to speculate on the nature of space and time, 
which resulted in the machine esthetic. Marcel Duchamp’s famous “Large Glass” 
in Philadelphia was actually based on his musings about physics. Some historians 
argue that Girard Desargues invented projective geometry as a result of concern 
with perspective in art. 

And so on. That is the way civilization is created, not by streams running each in 
its own course, but by streams coursing together. I can’t help think that it is long 
past time on the part of both writers and scientists to emphasize their commonality 
of experience rather than their separateness of existence. 
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Social Constructivism as a Philosophy of Mathematics. By Paul Ernest. State University 


of New York Press, 1998, xiv +.315 pp., $19.95 softcover, $59.50 hardcover. 
What Is Mathematics, Really? By Reuben Hersh. Oxford University Press, 1997, 
xxiv + 343 pp., $35.00. 


Reviewed by Bonnie Gold 


In the early years of this century, Platonism (by which I mean the belief that 
mathematics is the science of certain mind-independent, non-physical objects with 
determinate properties) was dethroned as the dominant philosophy of mathemat- 
ics. Since then, there’s been a struggle to replace it with an alternative that avoids 
the philosophical problems of Platonism while accurately reflecting the working 
mathematician’s daily experiences of doing mathematical research. 
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None of Platonism’s immediate successors—logicism, formalism, intuitionism— 
has proved satisfactory. The first two fail to account for the role of the mathemati- 
cian in the establishment of mathematical knowledge, as if mathematical knowl- 
edge were possible without any mathematicians. They also don’t allow for the 
development of our knowledge of mathematics over time. The third, intuitionism, 
is unpopular because it rejects large parts of mathematics. As the philosophy of 
science started studying how scientific knowledge develops and the reasons for 
accepting new scientific theories, Lakatos and others began similar inquiries about 
mathematics. In the last 25 years, new candidates for philosophies of mathematics 
have become popular, including fictionalism, conventionalism, structuralism, and 
social constructivism. | 

In the books under review, both Paul Ernest and Reuben Hersh propose 
versions of social constructivism, which has been gaining adherents recently. 
However, their writing styles and viewpoints are completely different. Although 
each author speaks warmly of the other, they represent opposite extremes of the 
school. Mathematicians will almost universally find Hersh’s version palatable and 
Ernest’s unpleasant to read and at odds with actual mathematical practice. 

Part of the problem with Ernest’s book is really the fault of mathematicians, 
especially teachers of logic. Philosophy students often take a required mathematics 
course without really understanding it—they never internalize DeMorgan’s laws 
(the negation of “p and q” is “not p or not q”), and they don’t appreciate the 
importance of correct hypotheses in theorems—but they work so hard that we give 
them a passing grade. Alas, some of them go on to graduate school and beyond, 
and start writing books about the philosophy of mathematics that make us shudder 
with embarrassment as they betray their total lack of understanding of mathemat- 
ics. 

Ernest is a philosopher who thinks he knows about mathematics because he’s 
taken a few courses in mathematical logic. He hasn’t understood them. He bases a 
part of his philosophical position on an incorrect statement of the Craig Interpola- » 
tion Lemma. The lemma says that if one has two formulas A and B, possibly 
involving different symbols, and a proof that A — B, then there is a formula X 
involving only symbols that occur in both A and B such that there are proofs that 
A-—X and X — B. That is, if one formula implies another, then there is an 
interpolant in a language that they have in common. Ernest first states the lemma 
incorrectly (p. 204, footnote 13), saying that X “includes the mathematical con- 
cepts occurring in both A and B”—that is, it might be in a larger language! Then, 
he uses this bizarre misstatement of the theorem to claim that “no step from A to 
B in a proof is above further analysis, and there are no ultimate basic proof steps 
into which a published mathematical theorem can be analyzed.” Yet, once one has 
interpolated an X that is in the common language, there’s nothing further to be 
done! Certainly one can trivially add X arbitrarily often as an interpolant: A — X, 
X — X, X — X, and X — B; but this adds nothing, whereas the original act of 
interpolation in some sense gets at what A and B have in common that makes A 
imply B. After that, there is nothing further to analyze. 

This confusion is not a central point in Ernest’s argument (although he does 
refer to it twice, using it to claim that it’s not possible, even in principle, to reduce 
all proofs to a standard format), but it is typical of the lack of understanding of 
mathematics found throughout the book. He gives few concrete mathematical 
examples of what he is talking about, and when he does, frequently they are either 
incorrect or a misinterpretation of the mathematical result. 
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The audience for the book is clearly professional philosophers of mathematics, 
not mathematicians. Since the book is full of terms such as “reification” and 
“hermeneutic” and references to philosophers such as Gadamer, Bakhtin, and 
Collingwood, mathematicians will find it extremely tough slogging unless they have 
a good knowledge of philosophy and have closely followed the writings in philoso- 
phy of mathematics over the last 30 years. 

The philosophical point of view Ernest puts forth is that mathematics is simply 
whatever the community of mathematicians chooses to call “mathematics”, and 
mathematical truths are simply what we decide to baptize as truths. If tomorrow 
we decide that Z, has a subgroup of order 5, then it will, and that will be as 
“objectively” true as the fact, today, that it doesn’t. Recognizing that he has to 
account for the apparent “objectivity” of mathematics, Ernest simply uses Orwell’s 
Newspeak as a model and says that if a community agrees something is true, that’s 
“objectivity.”” He accounts for the universal agreement on the facts of mathematics 
by the observation that we bully our students in grade school to accept such 
“conventions” as “2 + 2 = 4,” so that by the time they get to college they view it 
as a law of nature. 

The basis of Ernest’s views is what he sees as extensions of the work of 
Wittgenstein and Lakatos. From Wittgenstein he takes the view that mathematics 
is a verbal game, played by rules. The rules are looked after by the community of 
mathematicians, and we accept new members into our community when they show 
that they can follow those rules. What the rules are, he declines to specify, 
although they include “proof,” which he views as a kind of conversation (again, 
unspecified, but varying over time) to “warrant” mathematics (a philosophical 
term roughly meaning to give a basis for something to count as an object of 
knowledge). Proof is by no means the only rule of the game: there are rules for 
discovery, for writing or discussing mathematics, etc., but no details beyond general 
principles are given. There is nothing even conceivably fixed in any of these rules 
—rather, Ernest insists that NO text can have a unique meaning, but simply has 
the meaning the community chooses to give it at present. As has become popular 
among philosophers of mathematics, he cites the Lowenheim-Skolem Theorem in 
support of this view, although this theorem is about first-order logic only, not all of 
mathematics. 

From Lakatos, Ernest takes the view that the philosophy of mathematics must 
be considered from a historical perspective, and revolutions in mathematics are a 
regular part of mathematical practice. Ernest takes this to mean that no facts are 
permanently true; “facts” true today may become false tomorrow. This is simply 
false. There are changes in the meanings of some words and different levels of 
rigor over time, so our philosophical interpretations of the mathematics change. 
But the facts themselves simply don’t change. Although other sciences and philo- 
sophical theories change their “facts” frequently, 2 + 2 remains 4. 

Hersh’s book is an attempt to set forth in greater detail the philosophy 
mentioned cursorily in his books written with Philip J. Davis, The Mathematical 
Experience and Descartes’ Dream. These books have done a substantial service to 
the mathematical community, and to the world at large, by attempting to bridge 
the gap between mathematicians and everyone else. There is far too little of this 
kind of expository writing by mathematicians. Hersh targets the same wide audi- 
ence in this new book, but in this case he serves no one well by the ambiguity of 
audience. There are too many references to specific mathematical facts for anyone 
without an undergraduate degree in mathematics fully to understand the book 
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(despite a final section that briefly explains various mathematical topics). On the 
other hand, the lack of specific references for his myriad quotes and paraphrasings 
makes it a Herculean task for the mathematician or philosopher to examine the 
contexts of the quotes. 

The title “What is Mathematics, Really” is in response to “What is Mathemat- 
ics” by Courant and Robbins [1], which “answered” the question by providing some 
very nice examples of mathematics, but which never summed up these examples in 
a clear, concise statement. Hersh’s book, an attempt to provide that statement, 
consists of three parts. The first part sets forth his philosophy of mathematics in a 
brief, well-written 90 pages. While I don’t believe that this is the correct view of 
mathematics, Hersh makes the case well. He has the best account (pp. 61-62) I’ve 
seen of the role of intuition in mathematics: the various ways mathematicians use 
the word and its importance in both the development of new mathematics and 
decisions about the correctness of this new mathematics. It’s worth getting the 
book simply to read this discussion. The second part is his particular take on the 
history of the philosophy of mathematics, in which he divides philosophers of 
mathematics into mainstream or humanist. I’m not sufficiently well-versed in this 
history to judge the accuracy of his division; the lack of detailed references is 
especially annoying in this part of the book. His classification of Brouwer with the 
traditionalists is particularly bizarre. Any reasonable understanding of intuitionism 
would place Brouwer as a forerunner of the social constructivist school: for 
intuitionists, mathematics is that which the community of mathematicians con- 
structs; mathematical objects don’t exist until constructed by mathematicians. The 
final part of the book begins with an extremely offensive classification of philoso- 
phers of mathematics into leftists and rightists, resulting in a body count of far 
more leftists on the humanist side and more rightists on the traditional side. (Of 
course, moving Brouwer to the correct side of the traditionalist /humanist contro- 
versy makes the count less one-sided.) I see it as a desperate argument to sway 
mathematicians (who mostly tend to be generous souls) toward his philosophy. 
This is followed by 65 pages of mathematical notes and comments, an attempt to 
make the various mathematical references through the body of the text accessible 
to the lay reader. 

Thus, the main new contribution of this book is in its first 90 pages. Hersh 
begins by discussing the problems of Platonism. The traditional Platonist view is 
that “mathematical entities exist outside space and time, outside thought and 
matter, in an abstract realm independent of any consciousness, individual or social” 
(p. 9). The traditional philosophical difficulties with this view are (1) it requires a 
belief in some abstract, non-physical, non-psychological realm, which might have 
been fine when God was central.to our world-view, but which is unattractive to 
modern intellectuals, and (2) even if such a realm exists, how do we, physical 
beings, have any contact with, or knowledge of, this realm? To the best of my 
knowledge, there haven’t been any serious recent attempts by mathematicians to 
modify Platonism to meet those objections. This leaves the field open for philoso- 
phers to declare victory over Platonism, and to offer alternative philosophical 
descriptions of mathematics. 

It’s not clear that any modern mathematicians who view themselves as Platonists 
(or “realists”) actually subscribe to Hersh’s full statement. Most mathematicians 
are not particularly committed to where or what mathematical objects are, just as 
long as they do have objective properties. Therefore Hersh’s view is attractive: 
mathematical objects are constructed by the community of mathematicians. Gener- 
ally a new concept is suggested by one mathematician, but often the idea is 
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developed and modified over a period of time by the community until it settles 
down to have a fixed definition. 

Mathematical objects, then, are neither mental nor physical; rather, they are 
social entities, in the same general category as monetary systems, or the Supreme 
Court (not the people, but the institution). So far, Hersh and Ernest are in rough 
agreement. For Hersh, however, once a mathematical object has been constructed 
by the community of mathematicians, it takes on a sort of life of its own. Its 
properties are no longer dependent on people, but are discovered; some proper- 
ties, not apparent when we first define the object, can be difficult to discover. This 
is the point at which Hersh and Ernest part company. For Ernest, only the current 
rules of the mathematical game make x” an even function, and it could become 
odd at any time. For Hersh, once the function has been defined, mathematicians 
no longer control it. 

Hersh’s version has several very attractive features. It fits fairly well with 
mathematics as done by mathematicians. Our knowledge of mathematics develops 
over time, as well it should if mathematicians invent new mathematical objects or 
discover connections among those already invented. Mathematics is no longer 
infallible, but that’s much more consistent with our actual experience than is the 
purported infallibility of traditional Platonism. After all, false proofs are often 
published, and mistakes are common when one works on new mathematics. Under 
Hersh’s view, the mathematical community determines what constitutes a proof, 
and proof becomes the standard set by the community for acceptance of new 
mathematical knowledge. This describes actual mathematical practice far better 
than formalism’s view that proofs are deductions in first-order logic, of which no 
human can comprehend any non-trivial examples. Further, there’s a role for 
mathematics education: it brings new members into the community. 

However, there are two important ways in which Hersh’s social constructivism is 
inadequate as a philosophy of mathematics. The first is that it doesn’t account well 
for the usefulness of mathematics in the world. Why should mathematics devel- 
oped before any application was known turn out to be useful, and often in a variety 
of unrelated contexts? Certainly social constructivism can explain the parts of 
mathematics developed in response to some societal need. But the majority of 
applications, especially in this century, came from mathematics developed for the 
pure interest of the mathematical question—so why should it later be found to 
have anything to do with the real world? 

The second, and, I think, the more serious problem with social constructivism is 
that it doesn’t distinguish between the facts of mathematics (the function x? is 
even, the group Z, has subgroups of order 1, 2, 3, and 6) and our knowledge of that 
mathematics. Plato’s bizarre suggestions on recollection from past lives aside, it’s 
clear that as children we are taught mathematics; the mathematical community’s 
knowledge of mathematics develops over time; and it involves a communal effort. 
That social constructivist philosophies do recognize this development, and credit it, 
is a principal reason they are attractive. But this recognition is not an argument 
against the independence of mathematical objects from human society. Our 
knowledge of physics also develops over time. Nonetheless, the facts of physics 
exist before their discovery, and they are independent of errors the community of 
physicists makes or debates within that community. Furthermore, the concepts of 
physics—atoms, velocity, force, and so on—are developed gradually by the com- 
munity of physicists. Yet physical objects either are or are not made up of atoms, 
and it’s not the community of physicists that makes that true or false: it’s the actual 
state of the world. That is, people develop and clarify concepts, but the objects 
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those concepts attempt to describe either are or are not present in the world. The 
same is true of the objects of mathematics. Symmetry properties of chemicals 
affect how those chemicals behave, and did so even before mathematicians 
discovered symmetry groups. Ropes didn’t start hanging in catenaries only after 
mathematicians “invented” that curve. 

Hersh consistently ignores this distinction between the facts of mathematics and 
our knowledge of those facts, and many of his statements become obviously false if 
one reads them with this distinction in mind. “From living experience we know two 
facts: Fact 1: Mathematical objects are created by humans...” (p. 16). In what way 
does our “creation” of Z, differ from our creation of quarks? Clearly, we single 
out the concept as something worth applying to part of our experience, but that 
isn’t creating the object. “Point 1 is that mathematics is a social-historic reality. 
This is not controversial” (p. 23). No, it’s not controversial, just false. Our 
knowledge of mathematics is a social-historic reality, though, and that isn’t contro- 
versial. 

There are several other problems with Hersh’s account of mathematics. 
“A realistic analysis of mathematical intuition should be a central goal of the 
philosophy of mathematics” (p. 62). Hersh asserts that our intuition of mathemati- 
cal objects comes from our education, from doing problems and getting checked 
for correctness by our teachers. We check that we have the same representation of 
a concept by seeing if we give the same answers to questions about it. That is, it’s a 
social activity. But this doesn’t square with experiences such as that of Ramanujan, 
who developed his results in complete isolation. Certainly, without a community of 
mathematicians—and especially without Hardy—to recognize the importance of 
his work, it would have been lost to the world. But Ramanujan’s intuition came 
from working with the mathematical objects themselves, not from the community. 
While Ramanujan’s case is certainly the extreme, the history of mathematics has 
numerous other examples of mathematicians (Desargues and Abel come to mind) 
who were so far ahead of their time that there was no community developing the 
intuition behind their discoveries. 

In the past, a requirement of a philosophy of mathematics has been that it 
account for the peculiar level of certainty we appear to have in mathematics. The 
three philosophical schools that developed around the turn of the century— 
logicism, intuitionism, and formalism—were the result of trying to regain the level 
of certainty for which mathematics was notorious before the discovery of non- 
Euclidean geometry, contradictions in set theory, and problems in the foundations 
of analysis. Hersh spends a lot of time trying to debunk the notion of “mathemati- 
cal certainty” on the grounds that formal proofs don’t give certainty, because 
nontrivial ones aren’t surveyable, and whence does our certainty come, if not 
through formal proofs? Yet once the definitions are understood, any two mathe- 
maticians, no matter what their cultures, should come to the same conclusions. 
That is different even from other sciences, not to mention studies (e.g., economics, 
politics) of the “socio-historical objects” Hersh would place mathematics among. 
Even if proofs don’t give absolute certainty, proofs are central to the establishment 
of a mathematical result. ’m happy to view proofs among the activities that 
concern human knowledge of mathematics, not the mathematics itself. But nothing 
in Hersh’s account explains the special role of proof in mathematics—why it’s so 
central for establishing the truth of mathematical facts. An adequate philosophy of 
mathematics must do a better job of incorporating the role of proof, beyond 
explaining it as a peculiar social custom. 
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I’m a mathematician. I work every day with such mathematical objects as the 
function x* and the group (or ring) Z,. I’m not really sure where or what these 
objects are. But I am sure that nothing the human community (mathematicians or 
otherwise) does will make x* into an odd function or make Z, have a subgroup of 
order 5. And any philosophy of mathematics that says otherwise must simply be 
rejected. 

What is needed is an account of mathematical objects, and facts of mathemat- 
ics, and our knowledge of these facts, that parallels the account of physical objects 
and facts of physics and our knowledge of them. This may be roughly what Godel 
meant when he wrote, “Despite their remoteness from sense experience, we do 
have something like a perception also of the objects of set theory, as is seen from 
the fact that the axioms force themselves upon us as being true.... This, too, may 
represent an aspect of objective reality” [2]. There’s no reason to believe, as Hersh 
asserts, that Godel believed this perception true only of sets and not of other 
mathematical objects. 

There is one alternative philosophy of mathematics that rivals social construc- 
tivism in growing popularity, but that Hersh touches on only briefly and trivially 
dismisses. It is the closest to a current attempt to account for mathematics in a 
Platonist manner while meeting the philosophical objections to older Platonist 
accounts. This is the view of mathematics as “the science of pattern.” While many 
mathematicians have supported this view (Lynn Steen and Peter Hilton among 
them), I’m unaware of any attempt on the level of the two books under review to 
present this view carefully and in detail. Hersh’s objection that mathematics 
doesn’t study every kind of pattern is trivially correct. That is why this view needs 
further exploration. However, the view that mathematics is the science of certain 
kinds of patterns allows mathematics to be independent of society and yet accounts 
for the ability of physical beings to contact objects in the world of mathematics. 

One thing human beings do extremely well is to observe patterns. We’re far 
superior to computers in this ability. With great difficulty we can program comput- 
ers to recognize very simple patterns, but human beings recognize some patterns 
when we are born and very quickly learn to recognize and work with many kinds of 
patterns. Furthermore, patterns partake of both the physical and the non-physical. 
In fact, we can see our ability to classify anything as a recognition of patterns. One 
collection of atoms, of leaves and branches in certain patterns, is a tree. We 
recognize the general pattern by giving it this collective name. Another pattern is a 
table. Some patterns are man-made, others are out there in the world, others are 
abstract, but our minds are what give them a collective name. To the extent that 
we Say that trees are objective and not dependent on the human mind, we can start 
attributing that same objectivity to the patterns mathematicians study. It’s certainly 
not the case that all patterns are mathematics. But at least somewhere in that 
region there seems a place to carve out the realm of mathematics—neither 
physical, mental, nor social, but certainly associated with the world, independent of 
us, and accessible to human beings. 

Such a view does a better job of accounting for the applicability of mathematics, 
for many of our patterns come from the physical world. That our knowledge of 
mathematics develops over time becomes no more surprising than that our 
knowledge of physics develops over time. That different cultures have been 
especially hospitable to the discovery of certain parts of our mathematical knowl- 
edge is almost inevitable, as the patterns that a culture surrounds itself with are 
particular to it—and so our mathematical knowledge is very much a socio-cultural 
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artifact. How proof helps guarantee mathematical truth remains to be explored, 
but it’s reasonable that a link can be found in the connection between patterns in 
the world and mathematicians looking for them. 

Although in the end I don’t believe Hersh has yet shown us what is mathematics, 
really, he has made an important contribution to the discussion. Any acceptable 
philosophy of mathematics must be consistent with actual practice of mathemati- 
cians—that we make errors, that our proofs are not exercises in formal logic, that 
our knowledge changes over time. Whatever we finally decide mathematics is, it is 
still discovered by humans; what is discovered may depend on social or cultural 
factors; and the discovering, teaching, and sharing of our knowledge of mathemat- 
ics remains something to be shared by all, both for the benefit of the growth of that 
knowledge and for the human race’s ability to rise above petty fraternal feuds. 
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| Today Devin catches me at it. 

| “Mommy, what’re you trying to do?”’ 
“Oh,” I say, “well, these lines. 
I’m trying to fix these lines.” 
And then I explain triangles. 
And then I explain transitive. 

| “Sometimes it can’t be done,’ I tell him, “‘and other times it can. 
I’m trying to figure out when it can.” | 
‘“T get it,” he says. “I get it, Mommy.” 

: And later he catches me at it again. 

| ‘That one worked, right?” 


‘Cause maybe, if that one worked, we can go play Parchesi. 
Or cards. Or ice cream. Or hanging out. 

Or at least Mommy won’t 

keep staring at those lines. 


Contributed by Marion Cohen, Drexel University, Philadelphia, PA 
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plex Function Theory. Frank Smithies. Cam- 
bridge Univ Pr, 1997, 216 pp, $59.95. [ISBN 
0-521-59278-X] Careful, detailed analysis of 
Cauchy’s contributions to complex analysis 
during the period 1814-1831 including work 
on the Cauchy—Riemann equations, Cauchy’s 
Theorem, residue theory, contour integration, 
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cians: Mathematics in Sweden before 1950. 
Lars Garding. History of Math., V. 13. AMS, 
1998, xiii + 288 pp, $75. [ISBN 0-8128-0612- 
2] Comprehensive description beginning in 
the 1700’s. Clear exposition of the many math- 
ematical contributions of the Swedes as well as 
glimpses into their personal lives. Includes a 
short biography of one of the most influential 
Swedish mathematicians, Gdsta Mittag—Leffler, 
who founded Acta Mathematica and brought 
Sonya Kovalevski to Sweden. BH 
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Logic, C, P. Consequences of the Axiom of 
Choice. Paul Howard, Jean E. Rubin. Math. 
Surv. & Mono., V. 59. AMS, 1998, vii + 
432 pp, $89, with disk. [ISBN 0-8218-0977- 
6] . Comprehensive list of statements proved 
using the Axiom of Choice. Includes proofs 
not readily available in the literature; gives ref- 
erences for others and for relationships between 
consequences. Describes models of set theory. 
Accompanying software provides access to in- 
formation on a single consequence or a pair of 
consequences. KES 
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Combinatorics, P. Steiner Minimal Trees. Di- 
etmar Cieslik. 
Applic., V. 23. Kluwer Academic, 1998, 
xi + 319 pp, $140. [ISBN 0-7923-4983- 
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Kluwer Academic, 1998, xxvii + 336 pp, $162. 
[ISBN 0-7923-5 109-6] 


Analysis, P. Applied Analysis. Eds: J. Robert 
Dorroh, et al. Contemp. Math., V..221. AMS, 
1999, ix + 284 pp, $59 (P). [ISBN 0-8218- 
0673-4] Papers from special sessions at the 
1996 Southeastern Regional Meeting of the 
AMS held in Baton Rouge, Louisiana. 


Analysis, P. Existence Theory for Nonlinear In- 
tegral and Integrodifferential Equations. Donal 
O’Regan, Maria Meehan. Math. & Its Applic., 
V. 445. Kluwer Academic, 1998, 1x + 218 pp, 
$102. [ISBN 0-7923-5089-8] 12 surveys of 
recent results in Fredholm and Volterra inte- 
gral and integrodifferential equations, resonant 
and nonresonant problems, integral inclusions, 
stochastic equations, and periodic problems. 


Analysis, P. Proceedings of the St. Petersburg 
Mathematical Society, Volume IV. Ed: O.A. La- 
dyzhenskaya. AMS Transl., Ser. 2, V. 188. 
AMS, 1999, vii + 250 pp, $115. [ISBN 0- 
8218-0613-0] 11 papers on topics in analysis, 
probability, and applications. | 
Algebraic Geometry, T(18), P. Moduli of 
Curves. Joe Harris, Ian Morrison. Grad. Texts 
in Math., V. 187. Springer-Verlag, 1998, xiii 
+ 366 pp, $59. [ISBN 0-387-98438-0] A 
guide to algebraic curves and how they vary 
in families. Emphasizes techniques and appli- 
cations rather than reference material. Basic 
background in algebraic geometry necessary; 
more would be helpful. JD 

Algebraic Geometry, P. Representation Theo- 
ries and Algebraic Geometry. Eds: A. Broer. 
NATO ASI Ser. C, V. 514. Kluwer Academic, 
1998, xxii +443 pp, $205. [ISBN 0-7923-5193- 
2] Proceedings of a 1997 NATO Advanced 
Study Institute at the Université de Montréal. 


Algebraic Geometry, P. Vector Bundles and 
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Their Applications. Glenys Luke, Alexander 
S. Mishchenko. Math. & Its Applic., V. 447. 
Kluwer Academic, 1998, viii + 254 pp, $130. 
[ISBN 0-7923-5 154-1] 


Geometry, P. Hypergeometric Functions, My 
Love: Modular Interpretations of Configuration 
Spaces. Masaaki Yoshida. Aspects of Math., 
V. E32. Friedr. Vieweg & Sohn, 1997, xvi + 
292 pp, $64. [ISBN 3-528-06925-2] Exuber- 
ant account of the modular interpretation of the 
configuration space of six lines on the projective 
plane. DB , 


Topology, P. Jame Topology and O-minimal 
Structures. Lou van den Dries. London Math. 
Soc. Lect. Note Ser., V. 248. Cambridge Univ 
Pr, 1998, x + 180 pp, $39.95 (P). [ISBN 0- 
521-59838-9] Self-contained treatment of O- 
minimal structures from geometric and topo- 
logical viewpoints. Assumes only basics of al- 
gebra, topology, analysis. Of interest to model- 
theorists, geometers, and topologists. JD 


Optimization, P. Nonsmooth Approach to Op- 
timization Problems with Equilibrium Con- 
straints: Theory, Applications and Numerical 
Results. Jiti Outrata, Michal Koévara, Jochem 
Zowe. Nonconvex Optimiz. & Its Applic., 
V. 28. Kluwer Academic, 1998, xxi + 273 pp, 
$129. [ISBN 0-7923-5170-3] 


Optimization, P. Generalized Convexity, Gen- 
eralized Monotonicity: Recent Results. Eds: 
Jean-Pierre Crouzeix, Juan-Enrique Martinez- 
Legaz, Michel Volle. Nonconvex Optimiz. & 
Its Applic., V. 27. Kluwer Academic, 1998, 
xv + 467 pp, $208. [ISBN 0-7923-5088- 
X] 23 lectures from the Fifth Symposium on 
Generalized Convexity held in 1996 at CIRM, 
Luminy, France. Papers organized into four 
sections: Generalized Convexity; Generalized 
Monotonicity; Optimality Conditions and Du- 
ality; Vectorial Optimization. Each section con- 
tains at least one tutorial. 


Elementary Statistics, T**(1), C. Workshop 
Statistics: Discovery with Data and Minitab. 
Allan J. Rossman, Beth L. Chance. Springer- 
Verlag, 1998, xxviii + 486 pp, $39.95 (P). 
[ISBN 0-387-98411-9] New version of Work- 
shop Statistics: Discovery with Data (Springer- 
Verlag, 1996), the essential resource for 
activities-based statistics instruction. New ver- 
sion includes Minitab commands and macros, 
updated data sets. Extensive and helpful cur- 
ricular support (e.g., data sets, Minitab work- 
sheets) available on the internet. RL 


Mathematical Statistics, T(16-17). Bayesian 
Statistics: An Introduction, Second Edition. Pe- 
ter M. Lee. Wiley, 1997, xii + 344 pp, 
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$39.95 (P). [ISBN 0-471-19481-6] Expands 
First Edition by adding two chapters and in- 
creasing the number of exercises considerably. 
Solutions to exercises available on the web. HS 


Statistical Methods, T(16: 1), C. Statisti- 


cal Case Studies: A Collaboration Between 
Academe and Industry. Roxy Peck, Larry D. 


Haugh, Arnold Goodman. ASA—SIAM Ser. on - 


Stat. & Appl. Prob. SIAM, 1998, xxxi+ 282 pp, 
$26 (P), with disks. [ISBN 0-89871-413-3] 
20 case studies using data sets from business, 
industry, or government. Data sets have not 
been simplified for classroom use. Cases are 
ordered by level of sophistication; most are at 
an undergraduate level. No exercises. HS 


Statistical Methods, S(16). An Introduction to 
Statistical Modelling. Wojtek J. Krzanowski. 
Oxford Univ Pr, 1998, x + 252 pp, $19.99 (P). 
{ISBN 0-340-69185-9] Reference book with 
an applied slant. Covers core issues of classical 
Statistical modeling. Focus is on the statistical 
aspects; only the most essential mathematical 
justifications are given. HS 


Statistical Methods, T(17: 1), P, L. Nonre- 
sponse in Household Interview Surveys. Robert 
M. Groves, Mick P. Couper. Ser. in Prob. & Stat. 
Wiley, 1998, xiii + 344 pp, $79.75. [ISBN 0- 
471-18245-1] Discusses statistical impact of 
nonresponse on survey statistics, survey design 
features that might affect participation, the na- 
ture of interviewer-householder interaction pro- 
ducing the participation, and post-survey ad- 
justments to correct for nonresponse. Assumes 
an initial undergraduate survey methods course. 
No exercises. KB 


Statistical Methods, T(17-18, 1, 2), P, L. Re- 
gression Analysis of Count Data. A. Colin 
Cameron, Pravin K. Trivedi. Econometric Soc. 
Mono. Cambridge Univ Pr, 1998, xvii+411 pp, 
$64.95; $24.95 (P). [ISBN 0-521-63201-3; 0- 
521-63567-5] An integrative survey of liter- 
ature on count data regressions. Comprehen- 
sive and up-to-date. Makes sophisticated meth- 
ods accessible to practitioners of different back- 
- grounds. Emphasizes issues arising in econo- 
metric applications. KB . 


Statistical Methods, T(17: 1), P. Applied Cat- 
egorical Data Analysis. Chap T. Le. Ser. in 
Prob. & Stat. Texts & Ref. Sec. Wiley, 1998, xiii 
+ 287 pp, $64.95 (P). [ISBN 0-471-24060-5] 
For students and research workers in biomed- 
ical fields. Application oriented; mathematics 
kept to a minimum. Many real-life examples 
and computer program samples. HS 


Statistical Methods, T(13: 1). Statistical 
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Quality Control: Strategies and Tools for Con- 
tinual Improvement. Johannes Ledolter, Claude 
W. Burrill. Wiley, 1999, x + 526 pp, $86.95. 
[ISBN 0-471-18378-4] Thorough coverage, 
with emphasis on understanding concepts and 
practical use. Covers nonstatistical problem 
solving tools and design of experiments for pro- 
cess improvement including Taguchi methods. 
Large collection of exercises and projects. Re- 
quires no statistical knowledge. KB 


Applications (Quantum Theory), P. Symme- 
tries in Science X. Eds: Bruno Gruber, Michael 
Ramek. Plenum Pr, 1998, ix + 454 pp, $139.50. 
[ISBN 0-306-45908-6] Proceedings of a 1997 
symposium held at the Collegium Mehrerau in 
Bregenz, Austria. 


Applications (Systems Theory), P. Time- 
Varying Systems and Computations. Patrick 
Dewilde, Alle-Jan van der Veen. Kluwer Aca- 
demic, 1998, xiii + 459 pp, $122. [ISBN 0- 
7923-8189-0] | 
Applications (Systems Theory), T(16—17). 
Mathematical Control Theory: Deterministic 
Finite Dimensional Systems, Second Edition. 
Eduardo D. Sontag. Texts in Appl. Math., V. 6. 
Springer-Verlag, 1998, xvi + 531 pp, $49.95. 
[ISBN 0-387-98489-5] New material on non- 
linear controllability via Lie-algebraic meth- 
ods, variational and numerical approaches to 
nonlinear control, time-optimal control of lin- 
ear systems, feedback linearization, nonlinear 
optimal feedback, controllability of neural net- 
works, and controllability of linear systems with 
bounded controls. (First Edition, TR, February 
1991.) 


Applications, P. Mathematical Morphology 
and its Applications to Image and Signal Pro- 
cessing. Eds: Henk J.A.M. Heijmans, Jos 
B.T.M. Roerdink. Computat. Imaging & Vi- 
sion, V. 12. Kluwer Academic, 1998, ix + 
442 pp, $195. [ISBN 0-7923-5133-9]  Pro- 
ceedings of a 1998 symposium held in Amster- 
dam, The Netherlands. 5 invited and 46 con- 
tributed papers in 8 sections: Theory; Topol- 
ogy and Geometry; Shape Analysis and Partial 
Differential Equations; Connected Operators; 
Segmentation; Random Models and Statistical 
Techniques; Algorithms; Applications. 


Reviewers 


KB: Karla Ballman, Macalester; DB: David Bressoud, 
Macalester; JD: Jill Dietz, St. Olaf; BH: Bruce Hanson, 
St. Olaf; LCL: Loren C. Larson, St. Olaf; RL: Rob Leduc, 
Macalester; KS: Karen Saxe, Macalester; HS: Heidi Shier- 
holz, St. Olaf; KES: Kay E. Smith, St. Olaf; PZ: Paul Zorn, 
St. Olaf. 
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Writing in the Teaching and Learning of Mathematics 
discusses both how to create effective writing assign- 
ments for mathematics classes and why instructors 
ought to consider using such assignments. The book 
is more than just a user’s manual for what some have 
termed “writing to learn mathematics,” it is an argu- 
ment for engaging students in a dialogue about the 
mathematics they are trying to learn. 


The first section, “First Steps,” contains chapters _ 
addressing the nuts and bolts of how to design and 
evaluate writing assignments. The second section, 
“Listening to Others,” introduces ideas such as 
audience, narrative, prewriting and process writing, 
which our colleagues in writing departments have 
found useful. Specific examples illustrate how these 
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are important for writing in mathematics classes. 
After the third section, “Major Projects,” the text 
concludes with “Narrating Mathematics,” a section 
making explicit what is implicit in the rest of the text: 
writing, speaking and thinking are all intertwined. 
By asking good questions and critiquing students’ 
manuscripts in an open, yet rigorous manner, 
instructors can get students at any level of ability 
and background to a deeper awareness of the beauty 
and power of mathematics. 
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Calculus projects you can give to your students. . . 


This book consists of eleven mathematics projects 
based on introductory single-variable calculus, 
together with some guidance on how to make use 
of them. Each project is presented as an amusing 
short story. In many of them a group of undergrad- 
uate mathematics students, formed into a consult- 
ing company called Math Iz Us, is hired to solve 
mathematical problems brought to them by clients. 
The problems solved include: helping to prosecute 
an accused pool shark, defending a driver accused 
of speeding, assisting a hockey coach in making his 
star forward a more effective goal scorer, and advis- 
ing a pirate captain on how to divide a gold-plated 
goose-egg fairly among his crew. 


In each problem, the problem solvers are required 
to present to their client a detailed written report of 
their findings. Thus, students must produce and 
analyze accurate mathematical models of complex, 
verbally presented “real life” situations, and write a 
clear technical account of their solution. 


Phone in Your Order Now! ®@ 1-800-331-1622 


Instructors who are looking for problems that are 
novel, interesting, and several levels more complex 
than the typical text book “word problem” will find 
them in this book. It will be of particular value to 
instructors who wish to combine training in appli- 
cations of calculus with training in technical writ- 
ing. The complexity of the problems makes them 
suitable for use as group projects. 


The calculus concepts on which the problems are 
based include: tangent and normal lines, optimiza- 
tion by use of critical points, inverse trig functions, 
volumes of solids, surface area integrals, and model- 
ing economic concepts using definite integrals. 
Although a few ideas from physics and economics 
are used in the problems, no prior knowledge of 
these fields is required. 
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Packaged with a PC compatible disk that enhances the 
material in the text. 


Suitable for classroom adoption in an innovative course for 
¢ a general education mathematics elective 
e amathematics or science major advanced elective 
e an interdisciplinary course, even at a relatively 
elementary level 
¢ a mathematical modeling course ina 
civil/environmental engineering program 


This book has a dual objective: first, to introduce the reader 
to some of the most important and widespread environmen- 
tal issues of the day; and second, to illustrate the vital role 
played by mathematical models in investigating these issues. 
The environmental issues addressed include: ground-water 
contamination, air pollution, and hazardous material emer- 
gencies. These issues are presented in their full real-world 
context, not as scientific or mathematical abstractions; and 
for background, readers are invited to investigate their status 
in their own communities. 

The first part of the book leads the reader through rela- 
tively elementary modeling of these phenomena, including 
simple algebraic equations for ground water, slightly more 
complex algebraic equations (preferably implemented on a 
spreadsheet or other computerized framework) for air pollu- 
tion, and a fully computerized modeling package for haz- 
ardous materials incident analysis. The interplay between 
physical intuition and mathematical analysis is emphasized. 

For more advanced readers, the second part of the book 
returns to the same three subjects but with a higher level of 
mathematical sophistication (adjustable to the preparation of 
the reader by selection of subsections.) Many important clas- 
sical mathematical themes are developed through this con- 
text, examples coming from single and multivariable calcu- 
lus, differential equations, numerical analysis, linear algebra 
and probability. The material is presented in such a way as to 
minimize the required background and to encourage the 
subsequent study of some of these fields. 

An elementary course for a general audience could be 
based entirely on Part I, and a higher level mathematics, sci- 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


Mathematical 
Modeling 
in the Environment 


Charles Hadlock 


Series: Classroom Resource Materials 


ence, or engineering course could move quickly to Part 2. 

A PC compatible diskette packaged with the text con- 
tains a spreadsheet program that facilitates the numerical 
experimentation with the Gaussian plume equation intro- 
duced in Chapter 3, as well as public domain DOS program 
(ARCHIE) for evaluating the consequences from various haz- 
ardous materials scenarios (e.g., the physical extent of flam- 
mable and toxic vapor clouds). Text is not tied to the use of 
this software, but it is included as an aid to meet the peda- 
gogical objectives of the text. 
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Contains the complete solutions and further discussion of 
nearly every exercise presented in the textbook. This includes 
both the mathematical/computational exercises as well as the 
research questions and investigations. Readers will benefit 
greatly from perusing solutions to the problems whether they 
have worked them out themselves or not. Students using this 
volume will still need to work out solutions of research ques- 
tions using their own sources and adapting them to their own 
geographic locations, or using their own computational 
schemes, so this volume could well be useful for students in 
many course contexts. Enrichment material is included on 
the topics of some of the exercises. Advice for teachers who 
lack previous environmental experience, but who want to 
teach this material is also provided and makes it practical for 
such persons to offer a course based on these volumes. 
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Without question, Leonhard Euler (1707-1783) 
ranks among history's greatest mathematicians. 
Across six decades of unmatched productivity, and 
despite a visual impairment that grew ever worse, 
he charted the course of mathematics throughout 
the eighteenth century and beyond. His reputation 
is captured in Laplace’s famous admonition, “Read 
Euler, read Euler. He is the master of us all.” 


Written for the mathematically literate reader, this 
book provides a glimpse of Euler in action. 
Following an introductory biographical sketch are 
chapters describing his contributions to eight differ- 
ent topics—number theory, logarithms, infinite 
series, analytic number theory, complex variables, 
algebra, geometry, and combinatorics. Each chapter 
begins with a prologue to establish the historical _ 
context and then proceeds to a detailed considera- 
tion of one or more Eulerian theorems on the sub- 
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ject at hand. Each chapter concludes with an epi- 
logue surveying subsequent developments or 
addressing related questions that remain unan- 
swered to this day. At the end of the book is a brief 
outline of Euler’s collected works, the monumental 
Opera Omnia, whose publication has consumed vir- 
tually all of the twentieth century. 


In all, the book contains three dozen proofs from 
this remarkable individual. Yet this is merely the tip 
of the scholarly iceberg, for Euler produced over 
30,000 pages of pure and applied mathematics dur- 
ing his lifetime. Euler: The Master of Us All samples 
the work of a mathematician whose influence, 
industry, and ingenuity are of the very highest order. 
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A Problem Oriented Approach 
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Leads students quickly to the key ideas of combinatorics 


in a logical and proactive way .. . 


This book teaches the art of enumeration, or count- 
ing, by leading the reader through a series of car- 
fully chosen problems that are arranged strategically 
to introduce concepts in a logical order and in a 
provocative way. 


The format is unique in that it combines features of 
a traditional textbook with those of a problem 
book. It is organized in eight sections, the first four 
of which cover the basic combinatorial entities of 
strings, combinations, distributions and partitions. 
The last four cover the special counting methods of 
inclusion and exclusion, recurrence relations, gen- 
erating functions, and the method of Pélya and 
Redfield that can be characterized as “counting 
modulo symmetry.” The subject matter is presented 
through a series of approximately 250 problems 
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with connecting text where appropriate, and is sup- 
plemented by approximately 220 additional prob- 
lems for homework assignments. Many appli- 
cations to probability are included throughout 

the book. 


While intended primarily for use as a text for a 
college-level course taken by mathematics, comput- 
er science and engineering students, the book is 
suitable as well for a general education course at a 
good liberal arts college, or for self-study. 
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H. S. M. Coxeter’s classic book on non-Euclidean 
geometry was first published in 1942, and enjoyed 
eight reprintings before it went out of print in 1968. 
The MAA is delighted to be the publisher of the 
sixth edition of this wonderful book, updated with 
a new section 15.9 on the author's useful concept of 
inversive distance. 


Throughout most of this book, non-Euclidean 
geometries in spaces of two or three dimensions are 
treated as specializations of real projective geometry 
in terms of a simple set of axioms concerning points, 
lines, planes, incidence, order and continuity, with 
no mention of the measurement of distances or angles. 
This synthetic development is followed by the intro- 
duction of homogeneous coordinates, beginning 
with Von Staudt’s idea of regarding points as entities 
that can be added or multiplied. Transformations 
that preserve incidence are called collineations. 
They lead in a natural way to elliptic isometries or . 


Non-Euclidean 
Geometry 


H. S. M. COXETER 


“congruent transformations”. Following a recom- 
mendation by Bertrand Russell, continuity is 
described in terms of order. Elliptic and hyperbolic 
geometries are derived from real projective geome- 
try by specializing an elliptic or hyperbolic polarity 
which transforms points into lines (in two dimen- 
sions) or planes (in three dimensions) and vice versa. 


An unusual feature of the book is its use of the 
general linear transformation of coordinates to derive 
the formulas of elliptic and hyperbolic trigonometry. 
The area of a triangle is related to the sum of its 
angles by means of an ingenious idea of Gauss. This 
treatment can be enjoyed by anyone who is familiar 
with algebra up to the elements of group theory. 
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Series: Spectrum 


A perfect gift for the new teacher. . . or for anyone inter- 
ested in the teaching of mathematics. 


This book is the legacy of twenty years of mathematics 
teaching. During this time, the author searched for moti- 
vation techniques, mnemonics, insightful proofs, and 
serious applications of humor to aid his teaching. The 
result is this book: part philosophy, part humor, and part 
biography. Readers will be amused and enlightened on 


every page. 


Mr. Stueben shows how he has used humor and word- 
play to motivate his students. The book is filled with 
wonderful problems and proofs, as well as the author's 
insights about how to approach teaching problem solving 
to high school students. Sections of the book also treat 
the use of calculators and computers in the classroom. A 
section on mnemonics shows how teachers can use mem- 
ory aids to help their students learn and retain material. 
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Twenty Years Before 
the Blackboard 


The Lessons and Humor of a Mathematics Teacher 


Michael Stueben with Diane Sandford 


All in all, Twenty Years Before the Blackboard provides a 
goldmine of ideas for the classroom teacher. Although Mr. 
Stueben taught at the high school level, his book is an excel- 
lent “methods” book for mathematics teachers at all levels. 


Read what Martin Gardner has to say about this fascinat- 
ing book: 


It’s been decades since I read so entertaining a book about 
mathematics. The book is a treasure-trove of mathematical 
jokes, rhymes, anecdotes, word play, mnemonics, and beautiful 
proofs. For teachers there is an abundance of wise advice 
based on the author's twenty years in high school teaching. 
Mathematicians at all levels, from amateurs to college profes- 
sors will not only chuckle over its gems, but learn much they 
did not know before. ——Martin Gardner 


Catalog Code: TYB/JR98 
174 pp., Paperbound, 1998, ISBN 0-88385-525-9 
List: $29.50 MAA Member: $23.50 
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A Primer 
of Abstract 
Mathematics 


Robert Ash 


Series: Classroom Resource Materials 


A Primer of Abstract Mathematics prepares the reader Contents: _ 
to cope with abstract mathematics, specifically abstract Logic and Foundations: Truth Tables, Quantifiers, Proofs, 


algebra. It can serve as a text for prospective mathe- a Sean ee Oe te ene, 
matics majors, as well as for those students taking or eden igstaa cular atid ne gegen act 


Theorems, The Principle of Inclusion and Exclusion, 
preparing to take a first course in abstract algebra, or 


conn he ; Counting Infinite Sets. 
those in applied fields who need experience in dealing pie stitaty Number These The Riclidean alsodihen: 


with abstract mathematical ideas. Unique Factorization, Algebraic Structures, Further 
Properties of Congruence Modulo m, Linear Diophantine 


Learning any area of abstract mathematics involves writ- Equations and Simultaneous Congruences, Theorems of 


ing formal proofs, but it is equally important to think Euler and Fermat, The Mobius Inversion Formula. 
intuitively about the subject and to express ideas clearly some Highly Informal Set Theory: Well-Orderings, Zorn’s 
and cogently. The author aids intuition by keeping Lemma and the Axiom of Choice, Cardinal Numbers, 


proofs short and as informal as possible, using concrete | Addition and Multiplication of Cardinals. 

examples which illustrate all the features of the general Linear Algebra: Matrices, Determinants and Inverses, The 
case, and by giving heuristic arguments when a formal Vector Space F", Linear Independence and Bases, Subspaces, 
developmen t would take too long. The text can serve as Linear Transformations, Inner Product Spaces, Eigenvalues 


del on h h iaeat di and Eigenvectors. 
a model on how to write mathematics for an audience Tiesiy oF lnicax Opesascesordan Cancties loathe 


with limited experience in formalism and abstraction. Minimal and Characteristic Polynomials, The Adjoint of a 


Linear Operator, Normal Operators, The Existence of the 
Jordan Canonical Form. — 


Appendix: An Application of Linear Algebra. 


Ash introduces several expository innovations in A 
Primer of Abstract Mathematics. He presents an entirely 
informal development of set theory that gives students 
the basic results that they will need in algebra. The Catalog Code: BMAJR 

' 188 pp., Paperbound, 1998 
chapter which presents the theory of linear operators ISBN 0-88385-709-X 
introduces the Jordan Canonical Form right at the begin- — List: $27.95 MAA Member: $21.95 


ning, with a proof of existence at the end of the chapter. 
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AMERICAN MATHEMATICAL SOCIETY 


These bestselling selections provide vital and informative historical, biographical, and educational perspectives in 
the field of mathematics and are frequently adopted as course texts. Choose from the titles below or browse the 
AMS Bookstore at www.ams.org/bookstore/ to find more General Interest works that best meet your needs. 


Poincaré and the Three Body How to Teach Mathematics 
Problem Second Edition 


June Barrow-Green, The Open University, Steven G. Krantz, Washington University, St. Louis, MO 
Milton Keynes, UK Praise for the First Edition ... 


Delightful and inter ESUng IO ead “ will he Ip An original contribution to the educational literature on 
professors ... p rovide some vely Mieiestng teaching mathematics at the post-secondary level. The 
(and needed) histor ical background to their I6c- book itself is an explicit proof of the author's claim “teach- 
tures. Any serious student of mathematical ing can be rewarding, useful, and fun” 


history will enjoy this treatise. —Zentralblatt fur Mathematik 


—Applied Mechanics Reviews 

snosiencawih as ere eee This expanded edition of the original bestseller, How to 

aN TE | 0-published with the London Mathematica Teach Mathematics, offers hands-on guidance for teach- 

Goeth aa aaa ele o as ss tart ahead R ing mathematics in the modern classroom setting. Twelve 
‘ TOR Beas Bane Me emO et Pie Ne appendices have been added that are written by experts 


is registered with the Charity Commissioners. who have a wide range of opinions and viewpoints on the 
History of Mathematics, Volume 11; 1997; 272 pages; — major teaching issues. 


See od WH Fe MS 1999; 207 pages; Softcover; ISBN 0-8218-1398-6; List $24 
All AMS members $19; Order code HTM/2MM94 


What’s Happening in the 
Mathematical Sciences, 1998-1999 


Bary ope ; eek : It may be an enjoyable task for high school undergradu- 
Praise for What's Happening in the Mathematical ate mathematics students, their teachers and people 
Sciences interested in the field, to read the book and to learn from 
Another choice of new exciting developments in mathe- it by working on the challenging ideas which are provided 
matics. These volumes really deserve a large audience, throughout the text. : ; 
students as well as researchers will be fascinated by the —éentralblatt fir Mathematik 
insights and overviews presented. The purpose of this book is to teach the basic principles 


Techniques of Problem Solving 
Steven G. Krantz 


—Zentralblatt fiir Mathematik of problem solving, including both mathematical and non- 
Cover picture, detail from “Wave-Particle Duality’, was mathematical problems. A Solutions Manual to most 
created and provided by artist Mel Fisher. Reproduced with end-of-chapter exercises Is available. 
permission. 1997; 465 pages; Softcover; ISBN 0-8218-0619-X; List $29; 


What’s Happening in the Mathematical Sciences, Volume All AMS members $23; Order code TPSMM94 
4; 1999; 126 pages; Softcover; ISBN 0-8218-0766-8; List $14; 


Ord de HAPPENING/4MM94 | 
raer coae A Gentle Introduction 


to Game Theory 


The Man Who Loved Only Numbers g..4 stahy University of Kansas, Lawrence 


The Story of Paul Erdds and the Search This volume is based on courses given by the author at the 


for Mathematical Truth University of Kansas. The exposition is “gentle” because it 
Paul Hoffman requires only some knowledge of coordinate geometry; 

A publication of Hyperion Press. linear programming is not used. It is “mathematical” 
because it is more concerned with the mathematical solu- 
tion of games than with their applications. 

Mathematical World, Volume 13; 1999; 176 pages; Softcover: 
ISBN 0-8218-1339-0; List $25; All AMS members $20; Order 
code MAWRLD/13MM94 


This book opens.doors on a world and characters that are 
often invisible. It is interesting that Hoffman, Erdés and 
others in the book remember the mathematical tidbit that 
first intrigued them and bound them to this world. Possibly 
a future scientist or mathematician, or future scientific 
writer, will remember something in this book that way. 
—New York Times Book Review 
Distributed worldwide by the American Mathematical Society. 
1998; 302 pages; Hardcover; ISBN 0-7868-6362-5; List $23; OED 
All AMS members $16; Order code MLONMM94 AMERI 


& 
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Order from: American Mathematical Society, P.O. Box 5904, Boston, MA 02206-5904, USA. For credit card orders, fax 1-401-455-4046 or call toll free 1-800-321-4AMS (4267) in the 
U. S. and Canada, 1-401-455-4000 worldwide. Or place your order through the AMS bookstore at www.ams.org/bookstore/. Residents of Canada, please include 7% GST. 
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This book contains the best problems selected 
from over 25 years of the Problem of the Week 
at Macalester College. Readers will find here a 
collection of intriguing and thought provoking 
problems that will give students (high school or 
beyond), teachers, and university professors a 
chance to experience the pleasure of wrestling 
with some beautiful problems of elementary 
mathematics. 


Compare your sleuthing talents with those of 
Sherlock Holmes, who made a bad mistake 
regarding the first problem in the collection: 
Determine the direction of travel of a bicycle 
that has left its tracks in a patch of mud. The 
collection contains a variety of other unusual 
and interesting problems in geometry, algebra, 
combinatorics and number theory. For exam- 
ple, if a pizza is sliced into eight 45-degree 
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Which Way did the 
Bicycle Go? 


and Other Intriguing Mathematical Mysteries 


Joseph D. E. Konhauser, Dan Velleman, and Stan Wagon 


Series: Dolciani Mathematical Expositions 


wedges meeting at a point other than the center 


of the pizza, and two people eat alternate 
wedges, will they get equal amounts of pizza? 
Or: What is the rightmost nonzero digit of the 
product 1° 2: 3:-- 1000000? Or: Is a manufac- 
turer’s claim that a certain unusual combination 
lock allows thousands of combinations justified? 


complete solutions to the 191 problems are 
included along with problem variations and 
topics for investigation. This collection will be 
especially valuable to teachers who are looking 
for stimulating ways to engage their studerits 
with the beauty and intrigue that can often be 
found in elementary mathematics. 
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236 pp., Paperbound, 1996, ISBN 0-88385-325-6 
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